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ABSTRACT 

This  thesis  studies  two  topics  in  the  theory  of  nonlinear 
filtering;  the  use  of  multiple  stochastic  integrals  to  analyze  filters, 
and  the  use  of  Lie  algebraic  and  operator-theoretic  techniques  to 
discover  new,  fini te-dimensional ly  solvable  filtering  problems. 

The  main  results  of  the  multiple  integral  techniques  are: 

1.  A  simpler  and  more  insightful  proof  of  a  result  of 
S.  Marcus  on  filtering  polynomials  functions  of  a 
Gauss-Markov  process. 

2.  A  formula  for  representing  the  product  of  two  multiple 
integrals  as  a  sum  of  multiple  integrals,  thus  providing 
a  rudimentary  calculus  of  multiple  integral  expansions. 

3.  An  expansion  of  the  optimal  mean  square  filter  as  a 
ratio  of  two  multiple  integral  expansions. 

4.  Integral  equations  for  the  kernels  of  the  best  mean 
square  filter  of  the  class  of  (finite)  rth  order  multiple 
integral  expansions. 

The  problem  of  estimating  a  diffusion  process  observed  in 
white  noise  is  studied  with  Lie  algebra  techniques.  Necessary  con¬ 
ditions,  and  in  the  scalar  case,  necessary  and  sufficient  conditions, 
are  given  for  estimation  algebra  finite  dimensionality.  Examples  of 
scalar  problems  with  fin.  dim.  estimation  algebras  are  discussed,  and 
it  is  shown  that,  from  among  them,  no  new  cases  exist  for  which  lakai’s 
equation  can  be  solved  by  a  Wei -Norman  type  method. 
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CHAPTER  1 :  INTRODUCTION 
1.1  The  Nonlinear  Filtering  Problem 


Nonlinear  filtering  theory  is  the  study  of  a  broad  range  of 
problems  in  the  estimation  of  stochastic  processes.  A  typical 
example  concerns  the  estimation  of  a  signal  in  additive  noise.  In 
this  situation,  one  is  interested  in  the  properties  of  a  stochastic 
process  { x ( t ) { t >0 }  called  the  signal.  One  might  want  to  know,  for 
instance,  the  value  of  f(x(t)),  for  a  function  f,  or  even  the 
value  g(x(s)$<s<t)  for  a  functional  of  the  past  of  x(*)  up  to  time  t. 
However,  information  about  x(-)  is  available  only  through  observation 
of  the  process 


h(t) 


ft 

h(s ,x(s) )ds  +  w{ t) 
JO 


(1.1) 


in  which  h  is  a  given  function  and  w(t)  is  a  "noise",  usually  an 
independent  increments  process.  Thus  estimates,  or  rather,  filters, 
for  f(x( t) )  and  g(x(s ) ;0<s<t)  must  be  constructed  from  (y(s) jO<s<t}. 


Minimization  of  the  mean  square  filtering  error  is  the 
criterion  generally  chosen  to  guide  filter  design.  Thus,  in  linear 
filtering  theory  the  goal  is  to  produce  the  best  (mean  square) 
estimate  that  is  a  linear  functional  of  iy(s) jO<s<t}.  Nonlinear 
filtering  theory  goes  further;  it  asks  for  the  best  mean  square 
estimate  given  the  past  of  y(-).  If  Ef2(x(t))<*>  and  if  denotes  the 
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cy  -algebra  a{y (s)  jO<_s<t} ,  it  is  well  known  that  this  estimate  is 
Tt(f)  =  E{f(x(t))!Fp.  The  goal  of  nonlinear  filtering  is  to  com¬ 
pute  or  to  approximate  this  conditional  expectation. 

Interest  in  filtering  problems  stems  from  their  central  role 
in  several  applied  subjects.  For  example,  in  the  theory  of  com¬ 
munication  (Van  Trees  [48]),  (1.1)  is  a  common  model  for  a  signal 
sent  in  a  noisy  channel;  successful  transmission  of  information  re 
quires  extracting  the  signal  from  the  noise.  It  may  also  be 
necessary  to  decide  on  the  basis  of  ty(s);s<t}  between  two 
possibilities,  h(s,x(s))eO,  s<t,  or  h(s,x(s))  =  a  given  signal. 
This  "signal  detection"  problem  is  closely  related  to  optimal 
filtering  (E.  Wong  [46]).  Stochastic  control  problems,  in  which 
a  control  is  to  be  chosen  so  as  to  influence  signal  process  be¬ 
havior,  can  also  involve  filtering  if  the  control  is  allowed  to 
depend  on  noisy  or  partial  observations  of  the  signal  (see 
Fleming  and  Rishel  [14]  and  references  cited  therein.). 

The  modern  literature  of  nonlinear  filtering  begins  with  the 
contribution  of  Kalman  and  Bucy  [24],  who  formulated  and  solved  th 
model  (1.1)  for  the  case  in  which  x(t)  is  a  Gaussian  diffusion, 
h(s,x)  is  a  linear  in  x,  w(t)  is  Brownian,  and  f(x(t) )=x( t) . 

Their  main  result,  to  be  stated  in  lemma  3.1,  proves  that  the 
conditional  density  of  x(t)  given  is  Gaussian  and  provides  a 
method  to  compute  the  conditional  mean  and  covariance  recursively. 
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For  few  other  cases  is  such  a  complete  and  easily  constructed 
solution  available.  However,  two  very  powerful  characterizations 
of  optimal  filters  are  known  to  hold  in  quite  general  situations. 
The  first  is  a  8ayes-type  formula  for  m  (f),  which  is  due  to 
Kallianpur  and  Striebel  [22]  and  which,  in  essence,  represents 
Tt(f)  by  a  functional  integration  in  process  path  space  (see 
Section  1.3).  It  is  valid  for  Brownian  noise  w(t)  with  minimal 
restrictions  on  x(*)»  h,  and  f.  When  the  signal  is  Markovian, 
m*(f)  can  be  further  characterized  as  the  solution  of  a  stochastic 
differential  equation  (Fujisaki,  Kallianpur  and  Kunita  [15]).  In 
general,  r+(f)  cannot  be  found  from  this  result  because  the  co¬ 
efficients  of  the  filter  equation  involve  optimal  estimates  of 
<j>(x(t))  for  functions  ?  different  from  f.  Thus  additional 
equations  are  required  to  compute  n^(?)t  which  in  turn  require 
estimates  of  yet  other  functions  of  x(t).  The  resulting  system 
of  equations  is  in  general  infinite-dimensional.  The  cleanest 
formulation  of  this  infinite  dimensionality  is  Zakai's  [47] 
stochastic  partial  differential  equation  for  an  unnormalized 
version  of  the  conditional  density,  assuming  this  density  exists 
(see  Chapter  5).  Finally,  several  very  recent  developments 
promise  new  insights.  V.  Benes  [1]  has  derived  new  examples  of 
explicitly  solvable  filtering  problems,  and  Brockett  and  Clark 
[17],  Brockett  [5],  and  Mitter  [35]  have  begun  applying  Lie 


algebraic  and  operator  techniques  to  the  study  of  conditional 
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density  equations.  These  developments  will  be  discussed  in 
Chapter  5. 

The  above* briefly  outlined  results  constitute  the  principal 
highlights  of  nonlinear  filtering  theory,  but,  despite  their 
mathematical  depth,  they  remain  incompletely  developed.  For 
many  common  filtering  problems  little  is  actually  known  about  the 
filter  structure  and  one  must  resort  to  reasonable,  but  ad  hoc 
techniques.  A  powerful  and  general  theory  for  building, 
analyzing  and  comparing  suooptimal  designs  does  not  exist. 

1  .2  Summary  of  Thesis 

This  thesis  studies  two  different  ideas  for  analyzing  non¬ 
linear  filtering  problems.  The  first  is  that  of  evaluating  or 
approximating  filters  by  expansions  in  series  of  multiple 
stochastic  integrals.  Such  an  approach  is  motivated  by  the  fact 
that  the  optimal  estimate  ^(f)  may  be  thought  of  as  a  functional 
Ft(y( * ) )  of  the  observation  orocess.  It  is  then  possible  to 
explore  mt(f)  within  the  framework  of  a  representation  theory  for 
F,  for  instance,  one  that  expands  F  in  a  series  of  simpler  and 
more  easily  manipulated  basis  functionals.  Multiple  integrals 
are  ideally  suited  for  this,  because  they  are  easy  to  handle  and 
because  they  can  represent  a  large  class  of  functionals  F 
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(see  theorem  2.2) . 

The  second  idea  differs  from  the  first  in  method  and  style. 
Rather  than  expansion  or  approximation,  it  studies  the  question 
of  when  a  filtering  problem  can  be  solved  in  an  exact,  finite 
dimensional  manner.  And  rather  than  being  probabilistic,  the 
techniques  are  algebraic  and  operator-theoretic.  Brockett  and 
Clark  [7],  Brockett  [5,6]  and  Hitter  [35]  have  shown  recently 
that  certain  Lie  algebras  of  operators,  called  estimation  alge¬ 
bras,  can  be  associated  to  the  problem  of  filtering  a  Markov 
process  observed  in  white  noise.  In  examples  with  known,  finite 
dimensionally  computable  conditional  densities,  that  is,  the 
examples  of  Kalman  and  Bucy  [24]  and  of  Benes  [1],  the  estimation 
algebra  is  also  finite  dimensional.  Conversely,  it  is  widely 
conjectured  that  given  appropriate  hypotheses.  Lie  algebra 
finite  dimensionality  will  imply  the  existence  of  a  finite 
dimensionally  computable  expression  for  the  conditional  density. 
This  suggests  the  strategy  taken  up  in  the  second  part  of  the 
thesis  research:  seek  all  problems  with  finite  dimensional 
estimation  algebras  and  try  to  solve  them. 

The  main  results  of  our  investigation  are  presented  in  the 
following  chapter  by  chapter  summary  of  the  thesis.  Chapter  2 
defines  the  multiple  stochastic  integral  and  develops  some  of  its 
fundamental  properties.  The  main  result  here  is  the  multipli- 


cation  formula,  (theorem  2.4,  Section  2.3),  which  describes  how 
to  re-expand  the  product  of  two  multiple  integrals  as  a  sum  of 
multiple  integrals  and  which  is  an  important  calculational  and  con¬ 
ceptual  tool  in  the  theory  of  Chapter  4.  Technical  lemmas  needed 
in  Chapters  3  and  4  are  also  stated  and  proved. 

Chapter  3  contains  a  proof  of  a  result  originally  due  to 
S.  Marcus  [29]  on  the  finite  dimensional  solvability  of  filters  for 
estimating  polynomial  functions  of  a  Gauss-Markov  signal  process 
given  linear,  but  noisy  observations.  The  proof  here  sets  the 
problem  in  the  context  of  Gaussian  process  theory  by  using 
multiple  integrals  and  homogeneous  chaos  theory.  It  is  simpler 
than  Marcus'  original  proof  and  explains  more  clearly  how  and  why 
a  finite  number  of  statistics  characterize  the  optimal  estimate. 

This  work  was  done  jointly  with  S.  Marcus  and  S.  K.  Mitter. 

In  Chapter  4  we  present  expansion  theories  for  the  general 
filtering  model  of  estimating  a  signal  in  white  Gaussian  noise. 
First,  we  derive  a  representation  of  the  full  optimal  filter  as 
a  ratio  of  multiple  integral  expansions.  In  effect,  this 
representation  evaluates  the  functional  integrals  of  the  Kallianpur- 
Striebel  formula  with  multiple  integrals.  Secondly,  we  pose  a 
basic  probl em, sugges ted  by  the  multiple  integral  idea,  for  the 
design  of  suboptimal  filters:  For  any  r,  what  is  the  best  (mean 
square)  estimator  having  the  form  of  an  rth  order  multiple 
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integral  expansion?  Using  the  expansion  representation  in  con¬ 
junction  with  the  mul tipi ication  formula,  we  derive  integral 
equations  for  the  kernels  of  a  best  rth  order  estimate.  We  then 
rederive  the  Kalman-Bucy  filter  and  discuss  the  case  r=2  as  examples 
of  the  technique. 

Chapter  5  discusses  the  Lie  algebra  approach  to  finite 
dimensional  filter  computation.  The  main  results  are  presented  in 
Section  5.3.  For  vector  diffusion  signals  with  non-singular,  constant 
local  covariance,  a  fairly  restrictive  necessary  condition  is  given  for 
estimation  algebra  finite  dimensionality.  In  the  scalar  case,  this 
allows  all  possible  problems  with  finite  dimensional  estimation 
algebras  to  be  listed.  A  solution  of  some  of  these  filtering  problems 
is  then  attempted  using  a  method  developed  and  discussed  in  Sections 
5.1  and  5.2.  The  result  is  that  only  those  previously  known  examples 
of  Benes  can  be  solved  finite-dimensionally  by  this  method. 

It  is  worth  remarking  that  the  last  chapter  is  discursive  in 
style  and  does  not  present  a  complete  theory.  This  chapter  is 
a  preliminary  report  and  discussion  on  calculations  of  interest  to 
a  new,  developing  theory  with  important  implications.  To  shorten 
the  exposition  and  concentrate  on  the  main  idea,  we  have  omitted 
certain  cases  from  the  analysis,  but,  as  shall  be  mentioned,  the 


A 
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results  extend  formally  to  them.  In  this  regard,  work  remains 
to  be  done  in  building  more  complete  results.  However,  we  feel 
the  ideas  are  sufficiently  well -developed  and  interesting  to  merit 
inclusion. 


The  Fundamental  Problem:  Mathematical  Prerequisites 


The  work  of  this  thesis  employs  techniques  from  a  variety 
of  fields.  Chapters  2-4  assume  familiarity  with  certain  elements 
of  stochastic  process  theory,  in  particular,  stochastic  inte¬ 
gration  with  respect  to  Brownian  motion,  stochastic  differential 
equations,  and  Ito's  rule.  Chapter  3  uses  some  homogeneous 
chaos  theory,  which  is  summarized  briefly  in  Appendix  1.  Finally, 
Chapter  5  requires  familiarity  with  the  use  of  Lie  algebra/ 

Lie  group  methods  in  systems  theory  and  with  the  theory  of  self- 
adjoint  operators.  Appendix  2  states  the  basic  definitions  and 
results  that  are  needed  from  operator  theory. 


We  will  adopt  the  following  conventions  throughout  the 
thesis:  all  Brownian  motions  are  assumed  to  have  mean  zero  and 
unity  scale;  if  { z ( t ) 1 1 >0 >  is  a  stochastic  process, 

F*  =  a { z ( s ) | s  <t }  denotes  the  --algebra  generated  by  z(s)  for 


s<t . 
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We  now  state  the  precise  filtering  problem  to  be  considered  in 
the  thesis.  Let  { x ( t ) | t  s  [0,T]}  be  a  measurable  real-valued  process  on  a 
probability  space  (:.,F,P).  Let  h(s,x)  be  a  Bore!  measurable  function. 

Set 

y(t)  »  [  h(s,x(s))ds  +  w(t)  t  e  [Q,T]  (1.2) 

h 

and  assume 

(T)  w  is  a  Srownian  motion  independent  of  the  signal 
process  x(  ) 

rT  ? 

(ii)  E  i  h ~ ( s ,x(s) )ds  <  - 
'0 

Defini tion  1,1,  A  process  {y ( t ) j  t  e  [0,T]}  defined  by  1.2  satisfy¬ 
ing  the  stated  assumptions  is  called  an  observation  semimarginqale. 
Given  a  functional  f(t;x(s) ,s<t)  of  the  past  of  x(-)>  we  want  to 
compute  the  optimal  mean  square  estimate 

*t(f)  *  Elf(t;x(s) ,s<) , F^;  . 

The  following  theorem  of  Kallianpur  and  Striebel  [22]  will 
be  a  principle  theoretical  tool  of  this  thesis.  For  a  good 
exposition,  see  Wong  [46]. 

Theorem  1 .1  (Kallianpur,  Striebel).  Let 

dP  T  J 

~  =  exp  [-  |  h ( x ( s ) ) dw ( s )  -  j  ;  h2(x(s);ds]. 

0  Jo 


Then  (i)  Pq  is  a  probability  measure,  P  and  Pq  are  mutually 

absolutely  continuous,  and 

T  T 

-  exptj  h(x(s))dy(s)  -  \  ^h2(x(s))ds] 

t  t 

( 1 1 )  Eo^^5— i  *y }  s  exPt[  h(x(s))dy(s)  -  \  h2(x(s))ds]. 

u  aKo  z  jo  h 

(iii)  W.r.t.  PQ,  y(-)  is  a  Brownian  motion  independent  of  x ( * )  - 

(iv)  x(0  has  the  same  law  w.r.t.  Pq  as  w.r.t.  P. 

(v)  E{f(t;x(s) ,s<t) | 

EQ{f(t;x(s),s<t)J£-|F*} 

Finally,  the  concept  of  innovations  will  occasionally  be 

needed. 

Definition  1 .2.  The  innovations  process  associated  to  the  filtering 
problem  of  (1.2)  is 

ft 

v(t)  *  y(t)  -  rr  (h(s,x(s))  ds 

JO  s 


Interestingly,  given  mild  restrictions  on  the  nature  of  x(*)  and  h(s,x), 
v(t)  is  a  Brownian  motion  (Lipster  and  Shiryayev  [28]). 
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CHAPTER  2  MULTIPLE  INTEGRALS 

This  chapter  will  define  multiple  stochastic  integrals 
with  deterministic  kernels,  discuss  their  basic  properties,  and 
establish  both  theoretical  and  technical  results  that  are  important 
in  filtering  applications.  The  main  result  of  this  chapter  is  the 
multiplication  formula  of  theorem  2.4  in  section  3. 

2.1  Definition  and  Basic  Properties  of  Multiple  Integrals 

This  section  is  devoted  to  a  brief  exposition  of  the 
multiple  Wiener  integral  and  its  elementary  properties.  Most  of  the 
material  is  well  known  and  is  due  to  Ito  (  20  ) ,  who  developed  the 
definition  in  its  present  form  and  demonstrated  its  connection  to 
homogeneous  chaos  theory.  In  addition,  we  prove  some  technical 
results,  including  a  construction  to  produce  multi  Die  integrals 
recursively  from  stochastic  differential  equations,  important  in 
subsequent  work. 

Let  {b ( t ) }t>g  be  a  Brownian  motion,  and  let 

Ft  5  a { b ( s ) | s <t }  denote  its  associated  family  of  sub-r-al gebras . 

If  $(s,u)  is  a  measurable  random  process  adapted  to  r*,(i.e., 

,T  *- 

a ( t; ,•)  is  F„  -measurable  for  every  t),  and  if  E  :-(s)  ds  <  », 

J0 

then  for  t  <_  T  we  can  define  the  measurable,  adaDted  process 

4> 

U 

r 

s(s)db(s;; 

*0 
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see,  for  instance,  Ltotser  and  Shi ryayev  [28  ].  Recall  the 
properties  of  this  integral 


EC 


♦(s)db(s)]  =  0 


0 

t 


(2 


r 


E  |  ?(s)db(s)  j  ^(s)db(s) 
0  0 


r 


E?(s)v(s)ds 


We  will  use  this  single  integral  to  define  multiple  integrals  by 
iteration,  a  technique  different  than  Ito's,  but  equivalent  in 
result  up  to  a  multiplicative  constant. 


Definition  2.1 


2  ^  v* 

(i)  f  s  L  ([0,1]  )  is  severable  or.  Si  >_ 


if 


N 


^(s1,---,sr)  =  J  41'(s1)-*-vJ.7'{sr)  for 


T  i  si  >  s2  -  ***  -  V 

(ii)  ^([0,T]r)  =  {fs  L“(  [0,T]r]  i  f  is 

(a)  separable  on  s-|  >.  •••  ^  sr,  and 

(b)  symmetric] 

(iii)  *L2([3,7lr)  =  •;f-:L2[D,"r[r  f  is  symmetric-. 


s 
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Definition  2.2  Let  f  -  ( CO ,Tj r) ,  f  =  y  ^  (s-j )  •  •  *y r(sr)  on 

~h 

S-j  >  •••  >_  S  .  The  ?'  multiple  stochastic  integral  of  f  up 
to  time  t  is  defined  inductively  by 


Ij(f)  = 


,r-l 


y-j(s)  I$"  (Y2,**'fr)db(s) 


(2.2) 


,  *1, 


and  I^(f)  is  defined  on  all  of  Lg([0,T]  )  by  linear  extension 
Also,  we  adopt  the  convention,  Iy(f)  =  f. 


Note  that  (2.2)  is  meaningful,  because  at  each  step, 
using  separability  and  induction,  y^(s)  I$  (Y2”*Yr)  is-a  measurable, 
Ft-adapted  process  and  hence  may  be  integrated. 

Theorem  2.1  For  f,  g  £/l2(C°’T]r) 

E  lj(f)  *  0  (2.3) 

E  ll(f)  ly(g)  -  ~r  (f»g)  2 

t  s,  sL 
t  1  r 

•  f  r 

-  j***j  lr(s1,---5r)g(s1,---,sr)dsr  •••ds1 

00  0 

_  r  r 

Therefore,  /rT  It  is  an  isometry  between  L  ([0,T]  )  and 

IyLL?(C0»T]r)].  Since  Al2([0,T]r)  is  dense  in  t2([ 0,T]r)  we  can 

extend  the  definition  of  iT  to  ([  0 ,T j r )  by  continuity. 


% 
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Proof  By  (2.1),  (2.3)  and  (2.4)  certainly  hold  for  r  =  1.  Let  f 
y  1  ( s ) . . . y r ( s r ) ,  9  *  a-|  ( s.| )  •  "ttyi s  ) ,  and  suppose  (2.3)  and  (2.4) 
hold  for  r  -  1 .  Then 


E  lj(f)  =  E 


t 

-  Y-j  (s)  I^'1  (y  •  **y  )ds  = 

Jq  1  s  2  r 

^t 

Y-| (s)  E  Ig  1  (yj* * *yr)ds  =  0 

J0 


I [(f)  i[(g) 


7-j  (s)ct-j  (s) 


Tr-1  f 
ls  v2 ' 


•  V  ) 


l[-]U 


sr- 1 

Q  Yl(sl}”*Yl(sr)al 


ar(sr)dsr---ds 


1  ‘ 


The  theorem  follows  by  induction  on 


Remarks  1  By  continuity,  (2.3)  and  (2.4)  hold  for  all  f  in 
L2([0,T]r). 

2.  It  is  not  necessary  to  require  that  f  be  symmetric 
since  integration  is  carried  out  only  over  the  set 
S1  it  s?  L  '  *  *  1  sr-  however,  the  convention  of  symmetry  is  usefu 


ar)ds 


later  on. 
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A  O  w% 

3.  For  f  £  C  [[0,T]  ],  let  f(s,...)  denote  the  section 
of  f  at  s.  We  want  to  write 

ft 

i:(f>  ■  !<(f(s,...))ds,  (2.5) 

J0 

V* 

but,  to  do  this,  we  need  a  measurable  version  of  I  (f(s, . . . ) ) .  If 

f  is  separable  on  s,  >_  1  l  sr  this  measurable  version  is 

immediately  guaranteed;  indeed,  this  is  how  we  defined  iT.  If  f 

is  not  separable,  let  (f  >  be  a  sequence  of  separable  functions 

such  that  !  I f n~f ! !  2  0«  We  then  see  that  lim  m.s  I^f^Cs,  •♦•)) 

L  n-*» 

is  a  measurable  version  of  I$  (f(s,«-*))  and  hence  (2.5)  is 

valid. 

00  _  2 

Let  G  G  =  l  (n,FT,P)  be  the  homogeneous  chaos 
r-0  r  1 

decomposition  of  { b ( s ) | s <T } ,  (see  Appendix  1  for  the  definition  of 
this  decomposition). 

Theorem  2.2  (I to) 

For  every  r,  Gf  =  {l£(f)|f  £  t2([0,T]r) }.  Thus,  if 
$  e  L2(n,FT,P) ,  there  exist  kernels  £/L2([0,Tlr)  such  that 
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Proof  The  full  proof  may  be  found  in  Ito  (20),  but  let  us  sketch 

briefly  why  it  is  true.  First,  note  that  the  spaces 

Vr  -  Ij(L^(Co,T]r) )  are  closed.  Moreover,  by  using  step  functions 

we  can  see  that  Vr  ?r  and  P^c.  Y  +-*-+Vr,  (VQ  =71),  (see 

appendix  1  for  the  definitions  of  ?  and  ?  ).  Since 

r  r 

Gf  =  Fp  €)  ?r_]>  it  suffices  to  show  V| _ L  Y  for  r  f  q,  that  is, 

that  integrals  of  different  order  are  orthogonal .  This  fact,  one 
of  the  salient  features  of  multiple  Wiener  integrals,  is  easily 
verified.  If  f  £  *£“( [C,T]r) ,  g  e  /L^([0,T]q),  r  >  q,  then,  from 
(2.5)  and  (2.1), 

T 

£  fyf)  I?(9)  =  |  E  I j"1  ( f(s ,...)) I^-"1  (g(s ,...) )ds 
0 


T  q-1  , 

r  r 

=  I  •••  E  I  L r ( s ] , 

;  sq  1 

0  0 


dV'dsl 


»  0  . 


Multiple  stochastic  integrals  generalize  easily  to  the 
vector  case.  If  b"^(t)  =  (b-j  (t) ,  •  •  • . b  (t) )  is  a  v-dimensional 

r  s  L  uO.i  J  ),  we  can  derme 


3rownian  motion  and  if 
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r 


r  :Z, 


If  (r,i. 


f  (q , k-,  *  *  *k  ) ,  then  the  integrals 

~r  q  j  t-i  j  •  •  *  > 

(f)  and  It  M(g)  are  orthogonal. 


In  the  case  of  separable  kernels,  a  construction  observed 
by  Brackett  (  3)  for  realizing  deterministic  Voltarra  series  can  be 
adapted  to  produce  multiple  stochastic  integrals  from  stochastic 
differential  equations.  This  result  motivates  the  use  of  multiple 
integrals  because  it  says  we  can  calculate,  or  at  least  approximate 
them  recursively.  Moreover,  the  criterion  of  kernel  separability 
is  used  in  Chapter  3  to  prove  finite  dimensional  computability  of 
certain  ootimal  filters. 


Iheore m  2.3.  Let  f  st^(C0,T]r) 


Ihen,  for  some  n,  there  exists 


an  lRn-valued  process  z(t)  that  satisfies 


dz(t)  =  l  A,(t)z(t)db  (t)  2(0)  =  z 


1=  1 


for  some  n  x  n  matrix  functions  A  (t),  l  =  v,  and  for  some 

r;2-i ,  •  •  • ,  l  j 

n-vector  function  c(t),  such  that  I„  (f)  =  c  (t)z(t),t<7. 


Proof  It  suffices  to  consider  f  =  y, (s,) — '„(s,J.  Sucpose 
-  »  i  r  r 


=  -  >  i  i  < 


'2 


<  i .  and  define  toe  ( r-M  )  * ;  r*!  1 


i 

matrix 


<  •  •  • 


f  vife(t) ,  (i,j)  =  '  i k  i 


0  ,  otherwise 


That  is: 


i ,  -  row 


*!<*)  ' 


■f  .-row 
J 


0  Vi  (t) 
1 1 


0  V  . 

J  v  -j  V  <-  I 

j 


Otherwise,  define  A  (t)  z  0.  Consider  the  system  z(t)  £  iR 


dz(t)  =  y  A  ( t ) z ( t ) dv  ( t ) ,  ZT(0)  =  (0, • • • ,0,1 )  . 
2=1  1 


We  have 


z„(t)  =  :  yr(s)db,  (s) 
;0  V 


IVry#*  ,i6*»-**  vC«*/ 
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Y-l 


z,(t)  = 


•ivw  "  W1^1'  r,'r'”^'’r'  ” 

r;V> 

Thus  I  1  r(f)  =  (1,0, •••,0}  z(t). 


(s, )  — '  (s  'db,  (sj***cb  (s.). 


Finally,  we  will  need  a  Fubini-type  lemma  on  the  inter¬ 
change  between  ds  and  db  integrations 


Lemma 


2.1  Let  f  s  ^"(CO,!]1"}.  For  t  <_  T 


t  t  r-2  t 

_ Ig"^ (f(s, • • *)ds  =  [  •••  [  [  f(u,s1,---,sr_1) 

0  0  "  0  'Sl 


du  db(sr_-j )  •  •  *db(s^ )  .  (2.6) 

t 

's. 


Proof  Define  9t(si ’  " ‘ ,sr-l ^  =  [  f ^u»si 5 ’ ' ’ »sr-l ^du'  The  r.h.s 
r-1  51 

(2.6)  is  It  (g^).  To  prove  the  lemma,  simply  verify  that 


E[ 


f  1 5~'*  (T(s,  •  •  •)  )ds  -  l[~](9tnZ  - 


by  using  the  basic  properties  of  the  multiple  stochastic  integral. 


2.2  The  Observation  Semi -Marti naale  Case 


For  purposes  of  filtering  we  must  define  multiple  stochastic 


intecral s 


& 


■Uni 
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t 

« 

0 


Y-l 


f(s1,---,sr)dy(sr)---dy(s1) 


(2.7) 


with  respect  to  observation  semi -martingales 


y(t)  » 


t 

* 

x(s)ds  +  w(t) 

0 


★ 


(2.8) 


(Recall,  from  definition  1.1  of  observation  semi -martingales,  that 

x(-)  and  w(-)  are  assumed  independent,  w  is  Brownian,  and 
T 


E 


x2(s)ds 


<  °°  for  some 


T,  0  <  T  <  «.-) 


Such  integrals  are  well 


j 

0 

known  and  are  developed  extensively  in  martingale  theory;  Meyer  [33] 
is  the  best  reference.  However,  the  structure  of  the  observation  semi 
martingale  case  allows  a  simple  construction,  which  we  develop 
here.  Begin  by  noticing  that,  as  stated  in  Theorem  1.1,  y( - )  is 
mutually  absolutely  continuous  w.r.t.  Brownian  motion;  if  P 
is  the  original  measure  under  which  the  processes  of  (2.8)  are 
defined,  there  exists  a  PQ  mutually  absolutely  continuous  w.r.t. 

P,  such  that  y(  )  is  Brownian  on  (n,F,P  )  for  t  <_  T.  Therefore, 
for  f  e  L  (10, T]  )  we  define  (2.7)  as  the  multiple  Wiener  integral 
of  the  previous  section  by  working  on  the  measure  space  (n,F,PQ)  and 
we  call  this  integral  It(f)  without  reference  to  measure. 


Remark  The  process  with  respect  to  which  multiple  integrals  are 
taken  will  always  be  clear  from  context  and  so  will  not  be 
indicated  in  the  notation  It(f). 

*  For  simplicity  of  notation,  we  have  set  h(s,x)=  x-  in  (2.8)  (see  section 
1.3).  The  results  to  follow  are  valid  for  general  h  satisfying  the 
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such  that 


For 

p{f 

J0 


--  adapted  processes  p(s,w) 

p 

ip  (s)ds  <  »}  =  1 ,  we  can  define  the  integral 


t 

<j»(s)dy(s) 

0 


t 

|  ?(s)xsds  + 
0 


t 

4>(s)dw(s) 

J 

0 


(2.9) 


(see  Liptser  and  Shiryayev  [28]).  As  with  the  Brownian  case,  (2.7) 
may  be  interpreted  as  an  iteration  of  (2.9) 


Lemma  2.2  Let  f  e  /L2(C0,T]r) 


Itr(f)  - 


t 

^  I^'1(f(s,-«-))dy(s) 


t  <  T. 


Proof:  This  result  is  an  easy  consequence  of  the  more  general  fact: 
t 

the  process  ;  ?(s)dy(s)  defined  in  (2.9)  is  stochastically  equiva- 
J0  ft 

lent  to  the  process  (  ;(s)dy(s))p  formed  by  working  on 

Jq  o 

(n,F,PQ)  where  y  is  Brownian.  The  equivalence  of  these  integrals 
is  obvious  for  stochastic  step  functions 


♦( 


U 

S,uj)  =  l  <f»(t .  »w)l  . 
1-1 


<s) 


and  it  follows  for  the  general  case  by  taking  limits  of  such 
step  functions. 


*  1 ,  f  f  ■]  (s)  =  indicator  function  of  (t.,t.  ,] 

' c  i » c  j  +  iJ  iii 
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The  expressions  (2.3)  and  (2.4)  for  the  mean  and  covariance 
of  multiple  integrals  no  longer  holds  in  the  semi -martingale  case. 
However,  it  is  important  in  later  calculations  to  evaluate  these 
moments,  and,  for  this,  the  next  lemma  is  useful. 


Lemma  2.3  Suppose  E[j  x2(s)ds]r  <  ». 
f  s  t2(C0,T]k)  0 

(D  Etlj(f)]2  lMk  ||f||22 


Then  for  k  <_  r  and 

;  <  »  is  independent  of 


* 


E  x(s1)---x(sk)dsk**-ds1  . 


Proof.  We  will  actually  prove  by  induction  the  more  general 
result:  for  r  _>  i  >_  k  z^,  •  •  •  e[0,T] 


E[x(a?)---x(ak+1)lJ  (fj2  <  h, ^ (ak+1 , • • • ,o,  )j  \f\  l2  (2.10) 


z,kv  k+1 


z-k< 


where  h^  k  e  L1  ([0,T]  ),  and 


%  si 


‘k-l 


EWoj)— x(ok+,)lJ  {f )J  *  {  |  —j  f(sr--.sk) 

K  ooo 


(2.11) 


E[x(s1 )  -  -  x ( s k )  (-rk+1 )  •  -x(c  ,}]dsk*  *ds. 
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Lemma  2.3  is  the  case  :  =  k  for  every  k  <_  First  we 
demonstrate  (2.10)  and  (2.111  for  r  >_  i  >  k  3  1 ,  using  the 
iterative  formula  of  lemma  2.2  and  the  independence  of  x(-)  and 
w(>).  Thus 


E[x(cri)-*x(cr2)  j  f($)dy(s)]2  =  E[x(crz)  •  *x(<j2) (f  f(s)x(s)ds 
0  0 

^  T 

J  f(s)dw(s)}]2£  [2E  j  [x(ai)-*x(s)]2ds  + 

0  0 

2  E[x(.„^.*(«2)2]]i!fi:* » 


(2.12! 


To  derive  the  inequality  in  (2.12),  the  Cauchy-Schwarz  inequality 

is  used  several  times,  h  ,  £  L  ([0,T]*~  )  for  i  <_r  because 
T  *•  >  » 

<•  n 

E[ !  x^(s)ds]r  <  *>.  Likewise 
J0 


•  j'l 

E[x(cl)  •  *x(o2)  J  f(s)cys]  »  E[x(<xa)**x(j2)[  f(s)x(s)ds  * 


(2.13) 


f (s)dw(s)]]  =  j  f(s)E[x(s)x(u2)**x(aA)]ds. 


Now  suppose  (2.10)  and  (2.11)  are  true  for  a  fixed  k 

and  all  i,  r  >  i  >  k.  Aaain,  using  I^(f)  =  f  I^(f(s,  •  • )  )dy(s) , 
-  -  ^  Jo  s 

Cauchy-Schwarz,  and  induction, 
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£[x< <f)]2  i 

z  c  ak+1 

*k+l  S-j 

2  |  j  E[x(crJl)**x(jk+2)x(s1)Ij  (f(s2>-,))]2ds2c!s1 
0  0  ’  2 

ak 

+  2  |  E[x(a£)..x(ak+2)I^(f(s,-*))]2ds 
0 


hz,k^s,ak+2’' ‘ ,az)ds  *  2hz-l,k( 


»crz)  ]  !  ;  f  ! 


,2Ik+l^<,k+2*”  ,cri^  !  • 


By  induction,  h^  ^  e  L^([0,T]Z  Thus  (2.10)  is  true  for 

k  +  1.  That  (2.10)  holds  for  k  also  implies 

T 

£  f  I*(f(s,-.))ds  <  -  . 

4 

0 

Thus,  because  of  (2.3) , 
t 

E  |  Ig(f(s,**) )dw(s)  =  0,  for  t  <_  T. 

0 

With  the  aid  of  this  equality  we  can  prove  that  (2.11)  also  is  true 
for  k  -  1 . 
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This  completes  the  induction  step.  Induction  stops  at  k  =  r  since 

we  have  required  r  _>  i  >_  '<  in  order  to  apply  E'1  x  os)r  <  ». 

-:0  s 


2.3  The  flu!  tipi  i cation  Formula 

-  ?  h 

To  any  given  functional  o  c  l  (a,F°,P)  of  a  Brownian 
motion  b(t),  t  <_  T,  one  can  associate  a  sequence  of  kernels 
{~r/r-Q>  kr  s  L“( [0,T]r) ,  such  that 


n- 1 

For  applications,  it  is  necessary  to  have  rules  for  manipulating 
this  representation,  rules  that  describe  how  the  kernels  '<n 
behave  under  various  transformations  of  the  ?'s.  This  section 
states,  proves  and  discusses  such  a  rule  for  the  simplest  case;  if 
f  s  L^([0,T]r),  g  s  ^([OjT]^) ,  what  are  the  kernels  such 

that 


The  answer  will  require  some  new  definitions. 


Definition  2.3 

(i)  ?r  =  projection  of  L  {[0,Tjr)  onto  L'fjO," 


% 
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(?rh)(~r-*.-r'  =  J.,  ^  h(J-{1)’"’-~T(r)' 


where  Sr  =  pemutatioa  group  on  r  letters. 


(ii )  For  integers  r,q,k,  0_<k<min(r,q),  and 
f  e  t2([0,T]r),  g  £/L2([0,T]q) 


/  f 

K 


k  ®  (t)g^)  (a.| ,  •  •  ,Jr+q-2k' 


u  U 


=  k,  [  (  f(s1  »•  •  >  V'l  ’* '  ,Jr-k^ 

'0  0 


g(sl  ’  *“  ’V°r-k+l  ’  ‘ ’  ’>+q-2k'  dS|'  "  'ds’ 


<  i 


(Hi)  fk  0(«)  9k!«i. ,2k)  s 


W2kCfk©Cw9k: 


i  1 


"1  ’  ’ ^  r+o-Zk ' 


(iv)  f  Qg  a  f  0(t)gn  = 


Pr+q^al  ’ '  *  ,cr^ar+ 1  ’ '  ’  ,crr+q^ " 


r+q' 


0(t)  is  the  basic  operation  by  which  new  kernels  a^e 

indeed,  we  will  shew  in  lema  2.4  on  at 

02,  r,  -’0N  -ir+G-2k, 

<  l  ;  -  L  _  .  o  oectS' 


0 ( t)  is  the 

created 

fi'cm  old,  ana, 

■„  .  ^2,  -r  -.r. 
,k-  -  •  - J > 1 J  . 
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stand  G(t).  it  is  useful  to  think  of  the  functions  f  and  g 
as  tensors,  which  they  in  fact  are  under  the  isomorphism 
L2([0,T]r)  =  L2([0,T])  (?)  • •  0L2([Q,T])  (r-fold).  Then 
f]<  (x)(t)gk  may  be  viewed  as  a  tensor  contraction,  and 
f ^  o (t)gk  as  a  symmetrized  tensor  contraction.  The  notation 
fkO  ( t ) gk  is  meant  to  recall  the  summation  notation,  i.e., 
a  sum  (integral)  is  taken  along  the  first  k  indices  of  f  and  g. 
It  is  in  this  definition  that  we  make  use  of  the  convention  that  the 
integrands  f  and  g  are  symmetric;  otherwise  t)  would  have 
a  much  more  complicated  definition.  Finally,  as  an  example  of  0, 
consider  the  case  r  >_  q  =  k.  3y  direct  computation  using  the 
symmetry  of  f, 


fqG(t)gg(=,.-.o  ) 


t  t 

^-1 I_  f  .. !  y  *rs  . .  s 

ir-qj!  q!  j  !  J  ' ,s 
0  0  r-q 


t  t 

i  r 


d  T(  1  )  ’ '  '  ’^(r-o)  'g'Sl  ’’  '  ’  sr 


x  as  • -as, 

q  1 


q!  j  *'  |  r^sl  ’  "  ,sq,al  '■  ‘  ,ar-q'9^sr”  ,sqjdsq'  ‘dsl 
0  0' 


The  main  theorem  of  this  section  is: 


‘heorem  2.-1  Let 


.?2,rp  -  .^2/r.  --0, 

-  L  j  1  j  ;  y  ^  -  u  x  * 
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min(r.q) 

i;<f)  lj(g)  ■  I  l^-2k  0(t)Sl 


k=0 


r-K 


i 


i] 


(2.14)  shall  be  referred  to  as  the  multipl ication  formula.  Our 
proof  of  (2.14)  uses  Ito's  differentiation  rule  and  induction,  and 
it  is  fairly  complicated  in  its  details.  Therefore,  before  embarking 
on  the  proof,  we  will  set  forth  the  relevant  properties  of  0 
in  some  preliminary  lemmas.  In  what  follows,  f  will  always  denote 
a  function  in  /L2([0,7]r),  g  a  function  in  L2([0,T]q). 


.ernma  2.4  For  every  t  <_  T 


f,  0(t) %  L2([0,lf+q-2k) 


[n  fact 


i=r,q,k  "«'!2 


where  c  .  is  independent  of  f  and  g. 
r  ,  q  ,  K 


Proof  It  suffices  to  prove  the  lenna  for  ®,  instead  of  G  , 
since  pr+q_2i<  1S  a  bounded  operator.  Let  do  =  do]  ’  ,cicr+q.2k 
ds  =  ds-j-'ds^.  we  then  have,  using  the  Cauchy=Schwarc 
inequal i ty 


::<  ©  ( t)g(< ' 


:o,t: 


'<:  ;2 


% 

L 
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*  C  ]  dsf{s1,--,sk,3l,..)g(sr--,s.K,--,,r+q_ 
CO,T]k 


(k! )' 


da 


f  —  2 

ds  i  ( s-j » •  •  >  Sj^  »o-|  >  •  •  ^ ) 


[0,T]r+q_2k  [0,T]k 


ds  g2(s1 


[0,7] 


,sk,Jr-k+r 


r+q-ck' 


(k!  )2 


Lemma  2.4  establishes  that  the  kernels  in  the  expansion 
2.14  are  square-integrable  and  hence  that  the  multiple  integrals 
are  well-defined.  The  next  lemma  collects  useful  identifies  and 
facts  about  Q  .  Recall  that  tne  notation  f(s. , ••  .s^ , • • )  in¬ 
dicates  the  section  of  f  in  which  the  first  !<  variables  are 
fixed  at  s^-.s^,  respectively. 


Lemma  2.5 


ff- 


<kG 


•< G 


.  ^  1 


:k.;G  C s ; g C s , 


<-  ; 


r-  i  r  j 


1 


,111;  rOT  <  >  i 
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S]^  V‘’*r+q-2k' 


£  r+q-2k  r(  Jr  ‘ ‘^kO  '  t';Sk  +  r+q-2k  rk  ®  ’“'k' 


^c2  ’ " " ’ Jr+q-2k^ 


(2.16) 


(Tv)  fO(t)5(ilt-,Vq)  =  f(c1  ,  •  •)  O  (t)  g 


+  fO  (t)g(jr  ••)]  (<72***’-r+q-2k'-  '2'17' 


?roo* 


(i)  follows  by  calculations  similar  to  the  proof  of 
lemma  2.4,  namely,  one  writes  out  the  definition 
of  the  square  norm  and  applies  Cauchy-Schwarq.  The 
details  will  not  presented. 

(ii)  3y  direct  calculation  and  definition,  using  the 
symmetry  of  f  and  g  extensively. 


fkO(t)gk 


?  L  —  ‘  * 

r+q-2kL  k!  j 


|  dS1..CSkf(Si,..,S.,..)g(Si  ,  ■ • ■ 3  k ,  •  •  ) 

o'' 


=  r> 


^q-2k 


*  s. 


3, 


k-1 


0-0 


•;  f:,sr  ..,sk,--;5is,,--,sk-vtsk 


:s. 
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3  3  3 

8  ?r+q-2!<[FT  !  "I  f  (sl  *  *  *  ’  3k  ’  '  ‘ }  g(  S1  ’  '  '  ’Sk ’ *  *  ;  dSk  ’  'dS  1  * 

'o  0  'o 


+  Pr+q-2!<^TFTy!  !  dS 


t  S1  Vl 

r  f  • 

t 


1  r(s1  , ,  ")g(s,  ♦•}dSi'*ds-|j 

0  :  o  1  K  i  .<  <i 


a  ^0(^)9^  +  !  ds  f(s,-*)k_i  G  (s)g(s,'-)k.i 

(iii)  and  (iv).  The  proofs  of  (iii)  and  (iv)  are  similar,  (iv) 
being  just  a  special  case  of  (iii).  Ne  shall  only  present  (iv) 
as  it  is  simpler.  Note  first  that,  by  definition, 


%[f(cr-.)0(t)g](,2>. 


r+q 


r  1 


r+q  ( r+q-1 ) 


tcS 


f( '1  ,ct(2)  ’  ’ *  ,3t( r ) ' 


r+o- 1 


g(cT(r+l),",CT(r+q)) 


'  ? 


where  t;  s  3  ,  ,  is  interpreted  as  a  oermutaticn  o: 

r+q-  I 

Now  using  the  symmetry  of  f,  (2.18)  may  be  written  as: 


12,*-,  r^c  ;• 


■  th 

j  position 


■J-1 


/  V*  ^ 


r+o- 
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Using  the  expression  analogous  to  (2.19)  for  f  G  (t}g(cr,  •  • ) 


-r+Q  >  '*)  ©  ( -)g  +  ’  O  (t)  .  ••} ;  (*2’  **  ’V-q' 


Tn+qT  •  ^  - 1  _J  f(cT(2)’*‘’V‘’S(r) 

J_1  T"Sr+q-l 


jth  position 

i 


) 


g(aT(r^l)’,'>JT{r+q) 


j=l 


L  f(aT(2)’“’JT (r+1)) 


r+q-1 


.th 


j~"  position 
9(a~(r+2)’*,>ff1’“’'7T(r+q)^ 


( r+q  )"i  I  1 )  ’  *  *  ')g^7t(rJ-l )  ’  *  *  ’  Jm(r+q)  ‘ 

^r+q 


=  f  O  (t)g(a1,-*,cr+q) 
’his  is  the  desired  result. 


Proof  of  theorem  2.4.  We  use  Ito’s  differentiation  formula  and  the 
preceding  lemmas  to  implement  an  induction  argument  that  proceeds 
in  two  steps: 

(a)  Show  (by  induction)  that  (2.14)  holes  for  orcers  r  =  n,  q  =  1, 
Vn 
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(b)  Assuming  (2.14)  for  (r-],q),  (r,q-1)  and  ( r-1 ,q-l } ,  show  that 
it  holds  for  (r, q) . 

(a)  and  (b)  then  provide  a  consistent  scheme  of  induction  as  shown 
by  the  following  diagram  demonstrating  the  paths  of  implication. 


s= 


IT 

3 

0 

■=> 

c  => 

it 

tr 

2 

c 

0 

c  ■=? 

0  f 

* 

IT 

* 

1 

1 

A  _.S  /A 

=t> 

0 

— > 

0  ^=r> 

r= 


Step  (a)  For  r  =  1,  q  =  1.  By  Ito's  differentiation  rule 


t  S1 


t  t 

f  f(s)db(s)  f  g(s)db(s)  =  f  f  Cf(s1)g(s?)+f(s-)g(s1)]db(s9)db(s1) 
In  Jg  J  nJ  r  1  ^  c  1  -  1 


h 


1 0J  0 
t 

f(s)g(s)ds 


Suppose  that  the  theorem  *■  s  true  for  (r,q)  =  (n-1,1)  and  let 
f  e  "L^CCO.TjH),  g  c'L^([0,Tj).  Applying  Ito's  differentiation 
rule  again, 
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j  g(s)  lj(fjdb(s) 


i  in_1  r f r .■•m1 


■s  (f(s,--))rs(3)db(s) 


is_1(g(s)f(s,.-))ds 


By  induction, 


Ig'Vis.'-mVs)  =  lJ(nCf(s,“)Og])  +  lJ'2(f(s,--)j  o  ( S ) C 


Lemma  2 - 5 ( i )  and  lemma  2.1  justify  interchanging  integrations 
in  the  last  term  of  (2.20): 


f  I3"1 (g(s)f (s, - * ) )ds  =  lj_1(f  g(u)f(u,s1,--,sn_1)du) 


rhus,  by  substitution  in  (2.20) 


il'(f)  i;(g)  =  j  Ci"(g(s)f(*.))  +  i''(n[f(s>..)Ggj)}db(s) 

t  w  jQ  s  s 


-  (  l5'2(f(s,--)1G(s)g1)db(s) 


-  I?"1  ( j  g(u)f(u,s1  •  ,sn_i  )cu) 
'S1 


(2.20) 


ro 
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is<f)i’'1(g(s,--)) 


min(r,k) 

r  rr+q-l-2!< 

I  As 

k=0  s 


(( 


r+q-1 -2k 
r-k 


0  (s)gk( s , * • ) k) 

(2.23) 


nnn(q-l  ,r-l  )+l  2k  ,  r*0_2k 
*  l  ir+q  2k  ((r+q  2k)f(Sj..; 

k*l  s  r'k 


k-1 


O  (s)g(s,-  •).  , ) 


k-1 
(2.24) 


Now  substitute  (2.22)  -  (2.24)  into  (2.21),  interchange  dt  and  db(t) 
integrations  where  necessary,  and  collect  like  order  terms.  The 
result,  after  some  nasty  calculation,  is,  if  q  <  r 


5  Ip  ;[(r^-')rf(s,,--)Og]  *  (r^-')[fOg(5,,--)]](s2,",s 


r+q 


q-1 


^  xt  q  2k{^  r-l-k  )  Cr(s]*")kO  (S1  )9(sl  »’ ' )  k-^(  s2  ’ ' '  ,sr+q-2k' 


+  (  ^  (s-|  )g(S| •  )k^s2’' '  ,Sr+q-2k^ 


r-k  ;  u'k 
t 

;  f(u,..)k.10(u)g(u,--),<_1du';  (2.25) 


Ir-q. 


t“H([T(s1 ,• -)a  0  (s1 )g  ](s2>* •  ,s  ) 
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,  O  (u)g(u, * •}  ,du>  . 


Now  examine  the  kernels  of  the  last  expression  one-by-one.  The  first 
kernel  equals 


O  C^(f(sr--)0  g)(s2,--,s  ) 


+  tJt  (f  Og(s1,"))(s2,--,s  ) 


=  OffOsKy-.s  )  . 


(2.27) 


The  last  equality  comes  from  lemma  2.5(iv).  Likewise  apply 

V-+ Q  _  p  u 

lemma  2.5(iii)  and  (iv)  to  the  kernel  of  u  •  ,  1  <  k  <  q  -  1 

U 


The  kernel  of  i[+cl-2k  equals 


(  r-k"  )  ^r+q-2k  ^S1 '  ^k  ^S1 '9k^s2’ "  ^ 


+  r+q-2k  ^fk G  ' S1 si »  "  ' kJ  ' S2 ’  "  ^ 


+  (j  (u)g(u,--)k.1bu)(s2,-‘)] 


s  (  r-k'"<  ■’ L '  T|<  ®  ^I'V'V”’ 


k''V  ’  Vq-2k; 


,  G  (u)g(  j  ,  •  • ; .  .  cu :  ’s«. 
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“  (^2k!t(fkO(t)gkKs,,--,sr+q.2k)] 


(2. 28) 


Finally,  in  the  same  way,  the  kernel  for  is 


(fq°  (t)9q)(sr*-,sr_q)  . 


(2.29) 


By  combining  (2.27)  -  (2.29)  in  (2.25)  we  complete  the  induction 
step  (for  q  <  r) : 


lj(f)l’(9) 


=  "it'q,'r2k(!^2k)fkG('^k! 


The  proof  for  q  *  r  is  the  same;  we  need  only  check  that  the  lowest 
order  contribution  in  (2.25)  corresponding  to  k  =  q  is 


t  S1 

'q©(t)V|  j 

0  0 


>1 

!  f(s1,---,sq)g(s1,-*-,sq)dsq---dsr 


The  multiplication  formula  relates  directly  to  properties 

of  Hermite  polynomials,'  as  one  naturally  suspects  from  the  connection 

between  homogeneous  chaos  and  multiple  Wiener  integrals.  In  fact, 

letting  { h n ( x )  1' n _ g  denote  the  Hermite  polynomials  defined  in 

Appendix  1  and  takinc  •'?  '■*  ,  to  be  a  comolete  oothcrc'-ial  oasis 

n  n- 1 


:o  be  a  comolete  ootncrcr'mal  oasis 


2  r  .  . 

of  L  ([0,7]  ),  oecall  from  theorem  2.2  arc  treorem  t.  ,  . 


44 


£t2(LO,T]r)}  =  Gr 


pairwise  unequal  >  .  (2.30) 

Now,  the  h  (x)  satisfy  the  identities 

min(r.q) 

•  jo  /([)(<)  /(r:^-2k)  hr+q.2k(x)  (2.31 ) 

for  r,q  >_  0  [see  e.g.,  Magnus  and  Oberhetti.nger  [26]).  Comparing 
(2.31)  to  (2.14),  one  thus  sees  via  (2.30)  that  the  mul tipi ication 
formula  effectively  generalizes  the  identities  (2.31).  There  is  a 
discrepancy  between  (2.31)  and  (2.14)  in  the  factors  multiplying  the 
expansion  terms,  but  this  is  due  to  the  different  normalizations  in¬ 
volved  in  the  definitions  of  h  ,  Ir  and  O 

n 

It  is  natural  to  ask  whether  theorem  2.d  can  be  proved 
using  (2.31).  However,  this  strategy  appears  exceedingly  difficult 
to  implement  and  I  have  not  succeeded  in  doing  so.  Recently  Hida  [13] 
obtained  a  proof  o£  the  multiplication  *ormula  independently  of 
myself.  Kis  proof  effectively  generalizes  the  techniques  used  to 
prove  (2.31),  but  to  do  so  he  must  invoke  his  advances  theory  oc 
generalized  3rownian  *unctiona;s.  Cur  croc*,  t.ocucn  'nvciveo 
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computationally ,  proceeds  via  elementary  methods. 

The  multiplication  formula  can  also  Pe  used  to  generalize 

a  fact  about  Gaussian  random  variables.  Suppose  Z  is  a  normally 

2 

distributed  r.v.  with  mean  0  and  variance  a  .  Then  the  well  known 
identity 


£  Z2™  =  ( 2m- 1 ) 


expresses  the  higher  order  moments  of  Z  in  terms  of  the  variance. 
Clearly  it  implies  that  if  ;<n  is  a  sequence  of  mean  0  normal 

0  ?m 

r.v.'s,  EX*  •*  3  as  n  -*■  »  iff  EX*-  -  0  as  n  •*  »  for  any 

t*  h 

integer  m  _>  1 ,  or,  in  other  words,  that  mean  square  and  2m  ' 
order  convergence  are  equivalent  for  any  given  m.  Now  I^(f)  is 

w 

f*  h 

an  element  in  the  mean-square  closure  of  r  order  polynomials  of 
a  Gaussian  process,  and  hence  itsmoment  convergence  properties  are 
similar. 

Theorem  1.5  For  any  r  and  k,  there  exists  an  M  ,  <  *>  such 

r ,  k 

that 

E  {(l[(f))2k>  <  Mr>k!;fM2k  (2.32 

for  all  f  s  t2([0,T]r). 

Proof  Assume  that,  for  a  given  n, 
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nr 


(ir(f))n=  7 1*00 

1  i=0  ‘ 


(2. 


where 


2n 


ik  |  |  <  M |  | f  j  |  ,  i  <_  nr 


(2.34) 


Using  the  multiplication  formula  there  exist  kernels  h  such 


that 


-  n+l  (n+l)r 
(l"(f))nTl  =  l  I*(h,} 
1  A« 0  z 


and,  from  lemma  2.4,  there  will  exist  an  N  such  that 


nr 


i  n  l  !!M 

j=o  J 


;  2 


!f 


,  .2 


<  N : 


,2(n+l) 


for  every  i.  Thus  (2.33)  and  (2.34)  will  hold  for  n  +  1  also,  and, 
since  che  case,  n  =  1  is  true,  they  will  hold  for  all  n  ty 
induction.  But  then 


EClt(-')j2n  -  fEClW-'  i  :  ik- 1 !' 
z=0  z  z=0 


nr 


<  M"  j If i |2n. 

Of  course  M"  will  depend  on  n  and  r. 

Corollary  Let  {f  and  f  be  functions  in  ‘L2([Q,7]r). 
Then  f  -f  -  0  as  n  -  »  iff 


fO 


47 


2k 


-  0 


as  n  -  ® 


for  any  or  al  1  k  _>  1 . 

Remark  Theorem  2.5  is  not  new.  I.  Segal  [43]  has  derived  (2.32) 
in  the  context  of  Gauss  measures  on  Hilbert  spaces.  In  fact,  he 
obtains  a  universal  constant  c  such  tnat 

E(Ij(fj)  £  2  j  ' f | !  for  all  k. 

Tneorem  2.5  is  also  related  to  tie  Nelson  33  theory  of  n_,oe^- 

contracti vi ty ;  see  .Hitter  and  Ocone  36  .  Neveu  73  'S  a  good 

reference  for  one  version  of  Nelson's  r.yoercontracti vi  t>  t.neore^. 

McKean  [32]  and  Wiener  [45]  also  develop  identities  for 
expressions  similar  to  E[(Iy(f))“  ]  in  theorem  2.5.  In  these 
treatments,  the  interesting  corollary  above  is  not  generally  observed. 

The  next  result  is  a  variant  of  an  easy  and  well -known 
identity  first  appearing  in  Ito  [EG],  and  stated  here  in  (2.39).  Gur 
proof  employs  the  multiplication  formula. 

7  h 

Theorem  2.6  Let  ?  e  Pj  have  the  multiple  integral  expansion 


TO 


A 


w 
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'hen 


,  3  ) 

r' 


)  = 


w  .  J  V  -  1  J 


■O'.  7 


Proof  For  given  ’q  >  "]  1  • • •  ► 


>  V  let 


yl(s'  =  0  <  i  i  r 


be  the  indicator  function  of  [0,c-].  We  will  first  show  that 

rr. 


b ( o- -j )  •  •  * b ( -  )  =  I-(y‘ — ^ )  +  7  II  ^n(n^ 
1  r  1  n=0 


i  r ' 


O  35' 


•an  ere 


[•j]  =  xax{n 1  n  <_j) 


and  each  kernel  h 


has  the  *orm 


*1 


V"cr  i  =  l  **1 


in  whicn  t.ne  g_ 
o*  t.ne  indices 
:*  :3l Cut  at ' n: 
be: cw. 


J1  "’Jr 


»  » 


- r  (2.26) 

functions  eacn  depend  only  on  a  proper  subset 
This  fact  will  creatlv  sirrolifv  the  task 


r  ,s  . 


r  • 


as  wi i .  oe  seen  ■  ■ 


0**  ^ 


'  '  n  (“  a 

:  •/ 


'  $  i\  ' : 


car.  :*?  ^rovec 


VO 


Then 


49 


b(0---b(0  =  il(-/,)i;(v1...yr) 
u  r  i  . 


[r/2] 


Now  expand  all  the  products  in  this  sum  by  the  multiplication  formula. 
It  is  easily  seen  that  the  kernels  of  any  multiple  integrals  that  arise 
from  expanding  the  terms 


il(v°)i:-2n(hn 


will  be  of  the  form  (2.35).  However,  the  first  product  is 


T  1  f  0  \  T  f  t  1 

IT( Y  ) I T  C  Y 


r 

*Y 


\ 

J 


lj+]Ur+])y°0  ( T ) y 1 


rr)  x  Ij  ,((y°)1  O  (T)  (■ 


r  v 


"he  first  kernel  is 


1 


r 

r  0  f 

L  y 


for  sn  >  s, * ■  •>  s  . 
0—1  -  r 


1 


3ut  note  that  if  j  _>  0,  yu(s.)v,  'sn;=v'(sn!  *cr  sP  >  s  . ,  sine 
>_  s^  implies  _>  5i; 
written  more  simolv  as 


J' ■  VJ0'  -  -G  -  -J 

~hus  the  last  expression  nav  oe 


4 
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7ZT  yO(s0)"'y'  (V'  *  riT  y1(s0' 


S.:  1  )-^~1  (s,_,  - vr(s  ) 


r' 
(2.37 


and  only  the  first  term  depends  on  all  the  indices  ~r.,z-,‘  . 

W  l  I 

Likewise,  the  second  kernel  is  seen  to  be 


r1(s)Y2(s1)...vr(sr.1y 


(2.23) 


The  only  kernel  in  (2.37)  and  (2.23)  that  depends  on  every  index  is 


Thus  by  substituting  the  results  of  these  kernel  computations  into 
the  expansion  of  b(jg) • *b( a^) ,  we  find  that  (2.35)  is  true  for 
r  +  1  as  well.  Thus  (2.35)  holds  for  all  r  by  induction. 

As  a  result  of  (2.35) 


,r 

d 


5?  b(  7-j ) 


•b(: 


r-  Tr,  1 
7?  i-(v 


[r/2] 

n=0 


/,  n 

,  n 


t  S1  3r-l 


3a1  •  •  *3or  i  J  __  J  r  1 


UST---.SJ1 


0  0  0 


[0,arJ(sr)  dsr-’-ds. 


(2.39 


=  !<r(c1  ,  •  •  •  ,Jr)  • 


This  completes  the  proof. 


Finally,  we  note  that  theorem  2.4  extends  easily  to  the 
case  of  multiple  integrals  with  respect  to  observation  semi¬ 
martingales.  Under  added  assumptions,  theorem  2.5  and  its  corollary 
extends  also.  Indeed,  let  I-r  (f)  now  denote  integrals  with  respec 
to  y  and  let  be  the  measure  w.r.t.  which  y(*)  is  Brownian. 


Theorem 


d  P  ' 

2. 7  If  EQ  (-jjp— ;  <  «,  theorem  2.5  and  its  corollary  hole 


Proof  Use  the  Cauchy-Schwarz  inequality  to  derive 


7fir  lr  (-V.  <  c^/2  p — dP i  -1/2  r.r 

“LIT,y'!)J  c0  'dPn  *T ,y  ' '  '  —  u0  LdPnJ  “O  L‘T,yl')j 


1/2  ,dP_,%  ,..-,2k 

-  t0  dPQ  1  r,k  1 ‘ 1 1  ' 


H3 
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Remark 
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T 
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CHAPTER  3  ESTIMATION  OF  NONLINEAR  FUNCTIONS  OF  GAUSSIAN 
PROCESSES 

In  this  chapter,  we  begin  the  application  of  multiple 
integral  expansions  by  treating  a  filtering  problem  considered  by 
S.  Marcus  in  his  thesis  [29  ],  (see  also  Marcus  and  Wills ky  [31  ]). 
Marcus  succeeding  in  constructing  a  class  of  filtering  models  which 
are  interesting  for  systems  applications  and  for  which  optimal 
filters  can  be  finite  in  dimensionally  and  recursively  computed, 
(indeed,  a  rare  and  happy  event!) .  Roughly  speaking,  these  models 
pass  the  outputs  x(t)  of  linear  stochastic  systems  through  poly¬ 
nomial  nonlinearities  and  seek  to  estimate  the  result  based  on 
linear  observations  of  x(t)  in  white  noise. 

Marcus's  original  proofs  accordingly  rely  upon  linear 
filtering  theory  and  Gaussian  moment  identities,  and  so  his 
techniques  never  really  leave  the  realm  of  Gaussian  process  theory. 
One  .naturally  suspects  that  the  proper  framework  for  his  problem 
is  homogeneous  chaos  theory,  the  theory  of  polynomials  of  Gaussian 
processes.  In  what  follows,  we  will  show  this  suspicion  to  be 
well  founded  by  developing  a  direct  proof  of  Marcus's  results  with 
multiple  Wiener  integral  techniques.  We  feel  this  proof  explains 
in  the  clearest  manner  why  finite  dimensional  filters  occur  i.n 
this  problem  anc  hew  the  rilter  statistics  arise.  "The  resul  ts  to 
to  be  discussed  here  have  aooeared  in  Marcus,  Mister,  Gccne  *33  [, 


JU. 


r-<- 
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where  an  extension  by  Marcus  to  the  discrete  time  case  is  also 
presented.  It  should  be  noted,  as  an  aside,  that  solving  the  dis¬ 
crete  time  case  requires  using  polynomials  of  the  innovations, 

(see  Marcus,  et.  al  .  [30]).  This  feature,  which  does  not  occur  for 
continuous  time,  is  explained  by  homogeneous  chaos  theory.  We  will 
not  comment  on  the  discrete  case  any  further.  Finally,  Hida  and 
Kallianpur  [19]  solve  the  related  problem  of  predicting  polynomials 
of  a  Gaussian  process  using  noiseless  observations,  and  they  also 
use  multiple  integral  techniques.  3y  contrast,  the  results  here 
deal  with  the  case  of  noisy  observations. 

A  brief  exposition  of  the  homogeneous  chaos  theory  relevant 
to  this  chapter  is  presented  in  Appendix  1. 

The  problem  may  be  stated  as  follows.  Consider  the  linear 

system : 


dx(t) 

=  F(t)x(t)dt  + 

G(t)dw(t)  x(0)=:<g 

(3.1a 

dy(t) 

=  H(t)x(t)dt  + 

dn(t)  y(0) =0 

(3.1b 

In  (3.1):  x(t)  e  IRn,  y(t)  ;IRn;  F,  G,  and  H  are  piecewise  con¬ 
tinuous,  bounded  matrix  valued  functions;  w(t)  and  -,(t)  are  inde¬ 
pendent,  vector  Brownian  motions;  and  xr  is  a  Gaussian  random  vari- 
able  independent  of  both  w(>)  and  •;(•).  We  consider  x(c;  to  be  the 
signal  process,  y(t)  the  observation  process,  and  we  are  interested 
in  calculating  the  filter  f„  =  3{ f_( x(s ) ,s<  t '  for  functionals 
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ft  which  are  finite  Vol  terra  series  with  separable  kernels  in  the 
elements  of  x(t),  (see  definition  3.1). 

In  2.1,  we  gave  a  construction  for  realizing  a  multiple 
stochastic  integral  with  a  separable  kernel  as  the  output  of  a 
stochastic  differential  system.  8y  the  same  construction ,  we  can 
produce  ft,  i.e.,  there  exist  matrices  A  (t),  z=Vyn  and  a  vector 
c(t)  such  that 

ft  =  cT(t)z(t)  (3.2) 

where 

-  Cl  Az(t)xz(t)  ]  z(t)  z(0)»0  (3.3) 

z*  1 

Figure  1  illustrates  the  situation: 


Fig.  1 
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The  following  definitions  are  convenient: 


Definition  3.1 


,  =  {  V 

p  i=l 


fGp-l  . 

V  P; 


V  ,(ol)-*xk.  n(aP)d<V'd::1  i  £  Lloc(  *} 

1,1  Ip 


(where  aq  =  {yQU)\  yq  s  L^oc(  3R)>) 

Definition  3,2  The  filter  ftsE{r*t(x(s)  ,s<t) '  Fp  is  finite 

dimensionally  computable  (FDC),  if  it  can  be  computed  from  the 
output  of  a  finite  dimensional  stochastic  differential  equation 
driven  by  y(t) . 

Marcus  [ 29  ]  proved  : 

Theorem  3,1  For  any  p,  if  f^  s  a  ,  f ^  is  finite-dimensiona 

computabl e. 


Remark  The  theorem  remains  true  if  dy(t, 

R(t)dw(t}  where  R(t)>0  and  is  deterministic. 


x  ( t )  d  t 
roof  is 
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adaption  of  the  one  to  follow. 

To  carry  out  the  proof,  we  need  some  properties  of  the  state 
estimator  x(t)  =  E[x(t)|F^]. 

Lemma  3.1  (Kalman-Bucy) 

1) 

dx(t)  =  F(t)x(t)dt  +  P(t)H^(t)dv(t)  ,  x(0)  =  xQ 

where 

ft 

v(t)  =  y(t)  -  H(s)x(s)ds 
•  0 

and  P(t)  =  E[(x(t)-x(t))  (x(t)-x(t) )T]  satisfies 

P{t)  =  F ( t ) P ( t )  +  P( t) FT( t)  +  G(t)  -  P(t)HT(t)H(t)P(t) 

P(0)  =  cov(Xq) 

ii)  'j(t)  is  a  Brownian  motion  and  (up  to  sets  of  measure 

zero)  for  all  t. 

Proof  See,  for  instance,  M.H.A.  Davis  [  9,10  ].  The  process 
v(t)  defined  in  lemma  3.1  is  called  the  process,  and 


-  58  - 


It  is  the  key  to  our  proof,  because  it  is  a  Brownian  motion  that 
captures  the  information  in  y(-).  That  is,  if  f  is  a  random 
variable  such  that  Ef  <®,  then,  by  lemma  3.1  ii) 

E{f|F^}  =  E{f|Fp  and,  hence,  by  the  homogeneous  chaos  expansion, 
we  can  write 

E(f|F*}  =  k  (t)  +  7  C  k](t,o)dv.(a) 

1  0  i=l  JQ  !  1 


n 

+  I 

i  ,J=1 


k2'J(t,°l  »J2^dvi  (°2^v^3l 

J 


+  •  •  (3.4) 

By  using  the  innovations  process,  we  thus  achieve  an  orthogonal  de¬ 
composition  of  any  filter.  But  if  f  e  a  ,  we  can  go  much  further. 
Lemma  3,2  If  f^  ,  the  expansion  (3.4;  truncates  at  orde1"  p: 

n  ft 

f,  =  k  (t)  +  •  kj  (t,3)  d  ’.(:)  +  •  • 

10  i=l  Jq  1  1 

n 

+  l 

Z1 

dv  (  ;  )  •  *dv  .  (  3,  )  (3.5) 

*p  p  -1  1 


f°p-l  i. 

"I  k 
J  n  P 


p(t,a 


1  • 


V 
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2 

Proof:  From  the  definitions  of  x(t)  and  .\p»  £**£<=  and  thus  (2.  A) 

T  r  -r  r 

is  valid.  Now  consider  the  process  z'(t)  =  (x  (t) ,x  (t) ,■/  (t) ) . 

3 y  (3.1 )  and  lemma  1  i ) 


dz(t)  = 


F(t) 

H(t) 

H(t) 


.1 


P(t)-P(t)H  (t)H( t)  0 

-H(t)  0 


t)dt 


G(t) 

0 

0 


0 

P(t)HT(t) 

I 


1 


dw(  t) 
dr,(t) 


J  L 


z(o)T=  (xj,  xj,  0) 


Thus  z(t)  is  generated  from  a  linear  system  with  Gaussian  inpus  and 
Gaussian  initial  value  and  hence  is  Gaussian.  We  conclude  that 
(x  (t),  v  (t))  is  Gaussian  also,  and  hence  that  we  may  apply  tne 
homogeneous  chaos  construction  to  it.  In  Appendix  1,  this 
construction  is  developed  for  scalar  processes,  but  it  easy  to  see 
that  the  entire  theory  remains  valid  if  we  reolace  of  Appendix  1 
by 


H  =  S 


oan 


<  ;  (  3 


1, 


-  SO  - 


Letti ng  A  =  j{x . ( s) ,  (s)  |  1  <_  i  £  m,  1  <_  j  <_  n ,  s  <_  t} ,  we  can 

build  from  H  the  homogeneous  chaos  decomposition 


L 2(n,  A,  P)  =©  G 
1=0 


However,  we  can  also  perform  the  homogeneous  chaos  decomposition  on 
the  process  v(*)»  that  is,  if  G  (u)  denotes  the  i'*1  homogeneous 
chaos  of  v(s),  s  <  t, 


L2(a,  f^,  p)  =  0  gTTuT 

t  i=0 

-  p 
Evidently,  G  (v)  C.  G  for  each  i  and  ff  s  0 

z  z  “  z=0 

is  a  polynomial  in  x(-)  of  order  p.  Since 
l  P 

and  G,  -*-  ©  G,  for  k  >  p, 

K  i=0  * 


f  ^  -1-  G^(v),  l  >  p . 


But 


E{ft|Fi}  =  projection  of  ft  onto  0  G  (v) 
1  Z=0  1 

?  _ 

Thus  E\f*;F.;;  ©  G  (j)  as  desired. 

*  c  z=0  1 


since  ft 
G^  (v  )  c  G, 


We  shall  also  need 
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Lemma  3 . 3  Let  z  -  C^i » *  *  *  »Z|CJ  be  a  jointly  Gaussian  random  vector. 

k 

(1)  E[z,---zJ  ■  Ez,  Ez?*--z.  +  2  cov  r 2 7 , z •  ]  E[  n  2,] 

IK  c.  K  j=2  '  J  Z 

(ii)  E[z, •••!.]  «  Ez, • • *Ez.  +  \  cov(z.  ,z.  )Ez,  •••Ez, 

K  x  -O'!  >0  2  ^3  ^  k 

+  l  cov ( z  ■  }z  .  ) cov ( z  •  ,z.  )Ez  •  •••Ez, 

J1  -2  J3  J4  J5  -k 

where  the  sums  are  taken  over  all  combinations  of  pairs  in  1  •  -  - k > 

Proof  These  types  of  results  are  well -known-,  a  good  reference  is 
K.S.  Miller  [34],  The  particular  form  used  here  is  that  given  in 
Marcus  and  Will  sky  [31], 

Recall  again  that  when  a  multiple  Wiener  integral  has 
a  separable  kernel  it  is  fini te-dimensionally  computable.  Therefore, 

l  *  •  •  I 

by  lemma  2,  f^  will  be  FQC  if  the  kernels  kj.  r(t , j-j  ,  •  •  •  ,?r) , 
r  <_  p,  of  (3.5)  are  separable.  Proving  separability  is  thus  the 
strategy  of  the  proof  of  theorem  1.  We  shall  need  one  mere  lemma 
that  is  a  standard  fact  about  linear  stochastic  differential 
equations. 
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\  n 

,emna  3.^  Let  I,t;  Oe  tne  valued  solution  c-  a  linear  system 
with  3rownian  inputs. 

dZ(t)  =  A^t)Z(t)  dt  +  3(t)dW(t)  . 

Here  w'(t)  is  a  vector  3rownian  motion  and  A(t)  and  3(t)  are 
piecewise  continuous  matrices  of  appropriate  dimension.  Then 


Cr7 

-L4* 


( t)-EI( 


)][Z(s 


1 

i,  3  /  „ 


( t  <  S  ; 


/■+. 


,s)  +  1 


{ t  >  S  ; 


x2ct ,  s : 


where  1 ..  ,  denotes  the  indicator  function  of  {t<s>  and 

•„t  <S; 

X| ( t , s )  and  X9(t,s)  are  matrices  of  separable  functions. 


Proof  Let  }(t,s)  denote  the  state  transition  matrix  or  A(t),  tha 
is  ^  J(t,s)  =  A(t)j(t,s),  -;(s,s)  =  1.  Let 


K(s)  «  Cov[Z(s) ]  . 

One  easily  calculates  from  Z(t)  = 
that 


Tz(s1-E:(s; 


i  ( t ,  s ) Z { s )  +  .  J ( t , u ) S ( u ) dw ( u ) 

*  C 


5(t,0)r1(s,0)K(s)  s<: 


(s)[}(s,C)}  -{t.O) ]  S  > 


E[Z(t)-EZ(t) 
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3roof  c f  Theorem  1*  We  can  assume  that  nas  the  form 

- - - - — — -  L 


t  “P-1 


v0( «r,  («, )  •  •  *rp{ *p>  V  '  V  VV  •  **1  '3'5) 


0  0 


Then  has  a  finite  expansion  as  in  (3.5).  The  proof  will  consis 

of  showing  that  the  kernels  kr  r(t,o^ , • • • ,ar) >r<p,  of  the 

expansion  (3.5)  are  separable.  The  idea  is  to  use  theorem  2.6  to 
^1 "  *  *  ^r 

express  k  in  terms  of  rt  and  v.: 

r  l  t 


Z  , . ,  Z  />  r 

kr  (t  ,0] , . , .  ,cf)  5c  ;c  ^1  *'  •  •  ^rr^ 

I  r  1  r 


-  - _  C  r  f  .  '  -  \  ,  ,  ( r,  \  1  17  7' 

5C r--3cr  *  T  1  r  r 

for  t  >_  c^  >  a2  >  •••’  >  -r  •  The  second  equality  in  (3.7)  is  de¬ 
rived  from  the  fact  that  v(c)  is  F Y  -measurable  for  r  <  t  . 

w  ~~ 

When  the  expression  (3.6)  for  f„  is  used  in  (3.7),  the  result  is: 

it . . 

<r  (t  j-’i  ,  •  .^r; 


r0Vv  ^  '  ,yoVjo'5< 


0  Jo  u  "P  roav..3cv 


K*r  (*,).. -V  } 

I  0  I  ] 1 


dsp,..dSl  (3. 

'The  ioea  of  this  oroof  is  cue  to  S.  ,,!arcus.  It  aooears  in  Marcus 
Mitter,  and  Ccone  ”30]  but  with  some  er-ors.  The  errors  are 
corrected  here. 


A.  .wi 


Since  (x(t)  ,v(t)_ )  is  a  Gaussian  o  recess,  ’emma  3 . 3  i  i )  nay  be 
apolied  to  the  expectation  tern  in  the  integrand  of  (3,3).  The 
result  is  that  this  tern  may  be  written  as  a  sum  of  products  of  the 
following  terms 

Ex,  (s.)  ,  c o v [ x ,  (s.),x.  (s.)], 

Ki  Ki  1  Kj  - 

137  c°v[xk  (3  ), 

a  ’  j 

,2 

^  )  >V- 

3^  ..  3C:  <-,•  1  i  J 


Because  v(-)  is  a  Brownian  motion,  the  last  two  terms  are 
identically  zero.  The  first  two  terms  are  separable  functions  of 
(Sp . . .  ,sQ)  on  the  range  of  integration  s]  ^  s^  1  5n  ;  *c 


(Sp. 

•  •  » 

so} 

on 

the 

range 

thi  s 

for 

cov[x,K 

•  (3i 

l 

J 

si  - 

s . 

J 

i  f 

i  < 

j 

on  th 

ma i n i n  g 

term 

is 

CO 

3C 

v[x 

■V 

r)3 

3 

3a 

* 

(Note:  In  (3,9)  some  of  the  subscripts  have  been  drooped  *'cr 
simplicity  of  notation.)  Since  the  observation  noise  ~( • )  is 
indeoendent  of  the  sional  :<(•)  .  covT x  's'  -  ' ^  ^ 

(3.3)  =  co'/fx  _V 


(3.9) 


/  > 


'".US  , 
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system  with  Brownian  inputs.  This  system  may  be  explicitly  con¬ 
structed  using  the  :<alman-3ucy  result  in  lemma  3.1.  Thus,  using 


lemma  3,4, 

cov[xr(s }  ,[H(c)  (x (?)-:< (o) )  ]^]  =  } < 


> 


^2  v  s , c ) 


=  ]{s  <  ff}C^Cs,c)o2Cs,a}] 

+  P,,(s  ,c)  (3.11) 


where  p,  and  are  separable  functions. 


The  end  result  of  all  these  remarks  is  that  (3.3)  may  be 


written  as  a  sum  of  expressions  of  the  form 
s. 


rt 

I  . 
JQ 


P-1 


JO  0 


Yn(t)...-p(Sp)ai(s1)...uD(sp)£(ri;...£(Or) 


x1{s,  <  0,  }-'J{s.  <  c.  }  ds  . . . ds, 

^1  “  M  Jq  “  "a  p  1 


such  that  q  <  r 


and 


M . 

w  i 


•Jp>  c  --1 


.3: 


To  complete  the  proof  it  is  only  necessary  to  show  that  (3.12)  is 
separable  as  a  function  of  (t,o-j , . . .  ,o  )  ,  However,  by  appropriately 
adjusting  limits  of  integration  in  (3.12)  we  can  write  (3.12)  in  turn 
as  a  sum  of  terms  of  the  sort 


CO 
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Since  (x(t),'v(t))  is  a  Gaussian  process,  lemma  3 . 3  i  i }  may  be 
apolied  to  the  expectation  term  in  the  integrand  of  (3,3).  The 
result  is  that  this  term  may  be  written  as  a  sum  o*  products  of  the 
following  terms 


E*k .  (^ i )  ’  covtxk  Cs^ )  ,-xk  (s  ^  )1 , 


*  4  V 
J 


a  ’  J 

,2 

— - -  covfv  r~  '  •  (-  V 

a  ^  ^ aC ^  *-  .  I  k  i  j 

i  J  1  j 


and 


3*1 


3ecause  v(-)  is  a  Brownian  motion,  the  last  two  terms  are 
identically  zero.  The  first  two  terms  are  separable  functions  of 


(s-p.,.,sQ)  on  the  range  of  integration  s-j  _>  _> 


>  3 


to  see 


this  for  cov(x,.  (s-),x.  (s.)]  use  lemma  3.4  and  the  fact  that 

S'  1  Kj  J 

h  6  r  9  - 


s,  >  s.  if  i  <  j  on  the  ranee  s,  >  s-  >  ...  >  s 
1  ~  J  i  —  Z  —  —  p 

maining  term  is 

■£;  cov[xk(s),'Jz(-)]  =  ~  cov[xk(s},  j  C H ( s ) (x(s)-x(s) ) j^ds 


(3.9) 


(Note:  In  (3.9)  some  of  the  subscripts  have  been  drooped  £o r 
simplicity  of  nocation. )  Since  the  observation  noise  -(•;  is 
indeoendent  of  the  signal  ;<(•)  ,  ccv[x  (s )  ,r  ( r )  ]  a  C  .  "hus , 


(3.9)  =  C37[x^s:,[H(r;rx(r:-x(r;):;:  (ii 

x  (t; ,x‘ (z) ]  is  one  solution  o-  a  1  inear  scochasti. 
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such  that,  for  each  1  ,  ai-c{: ,s. ,  . . jp>  ,  M'^s j_i  >~i  >  •  •  •>  V  , 
a,  1  a?  1  1  a0  >  sj  1  a-ji]  ,1  ±  p  .  Note  that  [3.13)  contains  no 

indicator  functions  1 .  .  .  Using  the  identity 

fb  cb  ra 

g(s)ds  =  g(s)ds  -  '  g(s)ds 
J  a  Jo  <'0 

to  write  single  integrals  as  separable  functions  of  their  upper  and 
lower  limits,  it  is  seen  that  (3.13)  will  be  a  separable  function  of 
(t,a-|..."  )  .  Thus  (3.12)  and  (3.3)  will  also  be  separable,  since 
they  are  ultimately  sums  of  terms  like  (3.13),  It  is  worthwhile 
illustrating  the  last  argument  with  an  example. 


Consider 


!  |  a1  (s1  )e2(s2)»1  )32^2)  1,-  <  -  ‘{s  <  -  }ds2asl 

0  0  .-12-2 


By  straightforward  calculation,  this  equals 


c2  S1 


f 


1  “ 


3-j  ('^"j  *  j  al [S]  )a2^  s2^s2^s1  ~  j  j 

0  0  a?  0 


o2  S1 


=  v 


2, (s, )^?(s2)dSods1 


0  '0 


J2  '1  "2 
0  S2(s2}ds2  [;0S1:S1}^  "  .a"  S' 


wmcn  '3  separate. 
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CHAPTER  4  MULTIPLE  STOCHASTIC  INTEGRALS  A. 'JO  NONLINEAR  FILTERING 

This  chapter  applies  the  viewpoint  of  multiple  integral 
expansions  to  the  general  filtering  problem  stated  in  the  introduction . 
First,  the  Kail ianpur-Striebel  formula  is  used  in  4.1  to  derive 
a  representation  for  the  optimal  filter  as  a  ratio  of  two  multiple 
integral  series.  The  integrals  in  this  representation  are  formed 
with  respect  to  the  observation  process  and  have  kernels  that  depend 
only  upon  the  unconditioned  distribution  of  the  signal  process  and 
that  hence  may  be  computed  offline,  prior  to  receiving  any 
observations.  Secondly,  we  discuss  the  class  of  suboptimal  filters 
consisting  of  a  multiple  integral  expansion  truncated  after  a  finite 
number  of  terms.  By  combining  the  exact  filter  expansions ,  the 
multiplication  formula,  and  change  of  measure,  we  derive  kernel 

f*  h 

equations  for  the  kernels  of  the  best  r  order  filter  of  this  class. 

We  then  treat  the  cases  r  =  1  and  r  =  2  as  examples  and,  using 
the  same  techniques,  rederive  the  Kalman  filter. 

The  filter  expansion  presented  in  4.1  resembles  formulae 
obtained  by  Eterno  [11]  in  his  thesis.  Eterno  built  filter 
approximations  by  expanding  the  unnormalized  conditional  density  in 
moment  or  cumulant  power  series,  and  his  expressions,  when  aoc-rc- 
oriately  evaluated,  have  multiple  integral  interpretations . 

Cur  expansion,  which  can  also  be  apDliec  to  one  conditional  tensity, 
is  constructed  along  different  lines  and  we  acol j  it  to  a  o''*=rent 


■A*v 
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class  of  suboptimal  filter  designs. 

It  is  worthwhile  empnasizing  that  the  stochastic  integrals 
employed  in  this  section  are  taken  with  respect  to  the  observation 
process,  and  not,  as  in  Chapter  3,  with  respect  to  the  innovations 
process.  At  first,  integration  against  innovations  appears  to  be 
an  attractive  alternative,  because  the  innovations  are  3rownian  and 
so  allow  one  to  exploit  the  homogeneous  chaos  theory,  theorem  2.6 
for  computing  kernels,  and  etc.  in  approximating  filters.  However, 
in  constrast  with  Marcus'  problem,  the  innovations  are  not  easily 
calculated  for  they  require  optimally  estimating  the  signal  n(xt) 
(see  the  introduction),  a  problem  of  equivalent  difficulty  to  the 
original  one  of  estimating  an  arbitrary  functional  f(x  ,s<t). 

Integrals  against  the  observation  process,  on  the  other  hand,  are  more 
readily  computable,  but  less  easy  to  handle,  since  y(-)  is  not  in 
general  Brownian,  much  less  even  Gaussian.  y(-)-based  integrals  of 
different  orders  are,  for  example,  not  orthogonal,  making  it  difficult 
to  project  random  variables  on  finite  order  sums  of  integrals.  The 
technique  introduced  below  to  derive  kernel  equations  for  best  finite 
order  estimates  addresses  precisely  this  complication  and  provides 
tools  for  exploring  the  probabilistic  structure  of  multiple  integrals 
of  y(t)  in  more  detail.  Thus  integrals  of  v(t)  can  be  analyzed 
and  are  more  satisfactory  for  acplicazions  than  integrals  of  the 
innovations  process. 


sm*  r 
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4.1  Filter  Expansions 


To  fix  notation,  let  as  restate  the  general  "Uterine 
problem  and  the  Kail ianpur-Striebel  formula  for  the  optimal  estimate. 
(y(t)  |0  <_  t  <_  I},  (x(t)  [0  <  t  <_  T}  and  (w(t)  jO  <_  t  <_  T}  are  scalar 
valued  processes  on  the  probability  space  (n,F,P),  and  h(s,x)  is  a 
real-valued  (3orel)  function  such  that 


y(t)  =  |  h(s,x(s))ds  +  w(t)  t  <_  T, 

0 

T 

E  { f  h2(s,x(s) )ds}  <  =  , 


(4.1) 


and  w( • )  is  standard  Brownian,  independent  of  x(.). 


Let  =  F*  v  and  define  Pg  by 

dPn  f‘  i  ? 

—  =  exp[-j  h(s,x(s))dw(s)  -  jj  h  ( s  ,x( s )  )cs]  . 

0  0 

Recall  that  Pg  is  a  probability  measure  w.r.t.  wnicn  y(-)  and  x(-) 
are  independent,  x(-)  has  the  same  law  as  under  P,  and 
(y(t),G.)„  _  is  a  Wiener  orocess.  3y  this  last  statement,  we  mean 
that  w.r.t.  Pg.  y(t;  is  a  continuous  G*. -marts ngal e  suc.n  t.nat 
Hg-:[y(t)-y(s;:2  Gs}  =  t  -  s:  in  general,  -{  ^  Gr .  Let  *.'x,,s_t; 
be  a  “ini  t  e  variance  ncn-anticipative  functional  o;  rcr 

simolicity,  we  snail  in  the  sequel  always  senote  <  s_t  :y  r  t  . 
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and  likewise  h(s,x(s))  by  h(s).  Then  the  Kail ianpur-Striebel  ferrule 
states 

z'J' 


h  '=  E(f(t)|Fp  t 

EQ{f (t)exp[|  h(s)dy(s)  -  j  j  h2(s)ds] ! Fp 


(4.2) 


t  t 

Eq {exp [ j"  h(s)dy(s)  -  j  h2(s)ds]} 

o  Jo 


3ecause  of  its  importance,  we  single  out  the  exponential  term  in  (4.2! 
with  the  notation 


t  t 

Lt  5  exp[j  h(s)dy(s)  -  j  j^h2(s)dsj  . 


dp 


It  is  well  known,  (Wong  [45]),  that  L-  =  ;jp— ,  Lf  is  a  G^-mart ingale  on 


( <7 , F , P q )  and 


dlt  =  h(t)Ltdy(t),  LQ  =  1 


(4.3) 


(4.3)  is  the  crucial  relation  for  what  follows. 


In  order  to  state  the  main  theorem,  it  is  convenient  to 
introduce  the  functions 

in  (t,s1  ,• • •  ,sfl)  =  £{f(t)h(s1)--*h(sn);  n  >  0 


kn(s]  ,•••  ,sn)  =  E{h(s1 ) — h(sn) > 


(4.4) 


n  >  i 


Mote  that  in  (4. a)  the  expectation  operates  on  raneem  variables  wrier 
depend  only  on  the  :<(•)  process,  whose  law  is  invariant  unde'*  t.ne 
change  From  P  to  ?n.  Hence,  we  can  also  write 


•n 


t  <■  -  ...  <■  >  =  r  :  st  , 

-,J1’  ,n'  "O' 


^  n  .  S  ,  ,  ,  ^  ^  .  h  ,  i  .  n  ,  3  ; 

Hi  n  u  i  .n 


'i  = 


-  71  - 


Another  useful  process  'Is 

t  Sr 

L^>  =  [  ...f  h 


•••[  h(s-, )  •  •  *h(s  )L  dy(s  - )  — dy(  s,  5 

J  1  r+lsr+l  ‘ 

0  0 


The  existence  of  a  multiple  stochastic  integral  with 
random  coefficients,  will  be  justified  shortly.  We  now  state 
multiple  integral  expansions  for  f„. 


Iheorem  4.1 


(i)  If  EC!  h2(c)da]r  <  -  and  E[f2(t)  ( j  h2(5)dc)  r]  <  -, 

‘0  0 

V*> +  l  4r,««>  *  V,<t>Ltr>  Fv 

}  =  _ QfJ _  _ 

i  «•  i  i‘r)<kn>  *  E0a‘r>:^> 

n- 1 
T 

(ii)  If  E[exp  ^  h“(s)ds]  <  ®  and  E [f 2 ( t } ex p  '  hc's!os] 


V0  +  J1Itn>un) 


♦  i 


ind  the  infinite  series  in  (4.7)  converge  in  the 


%  _ tsm+ 


KO 
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Remarks  1.  We  call  (-1.6)  the  partial  filter  expansion  and  (1.7; 
the  full  expansion. 

2.  The  expansions  are  essentially  series  evaluations  of  the 
Bayes  formula  (4.2).  They  work  by  "separating"  the  x(-)  and  y(-) 
dependence  in  (4.2)  by  exoanding  L^;  y(-)  appears  only  when  integrated 
in  multiple  integrals  and  x(-)  is  integrated  out  in  the  kernels.  The 
kernels,  therefore,  require  only  knowledge  of  the  apriori  distribution 
of  x(-)  and  can  be  computed  offline  prior  to  filtering. 


Theorem  4.1  has  the  obvious  generalization  to  vector  valued  processes. 


The  proof  of  theorem  4.1  requires  that  we  handle  integrals 
r  r ' 

of  the  form  L*  ‘  or 

U 


r- 1 


j  •••[  h(s1 ) — h(sf)dy(sr) — dy( s^ ) 
0  'o 


(4.8) 


whose  kernels  are  random,  net  deterministic  as  in  chapter  3.  These 
are  easily  defined  cy  iteration.  For  this,  it  is  convenient  to  work 
with  the  measure  ?q,  with  respect  to  which  (y(t),G«J  is  a  Brownian 
process,  so  that  we  may  apply  the  standard  theory  of  stocnastic 
integration,  (see,  e.g.,  liptser  and  Shiryayev  [23]).  Thus,  if 

5 ( t )  is  a  measurable,  ^..-adapted  process  satisfvinc 

,rT  ,  ,  r  '  ' 

P„l  }"s  s )ds  <  d  =  : ,  we  have  an  :ntecral  :is)dv(s  w.ch  3 
u  -'3  ‘  _  rt  _  '  i  0 

version  soon  t.nat  suo  ji  s)dv(s)  <  =  1  iL’lctser, 

J  '  ' 

-J  V  W  "  *  J  *  -) 

Shir/ave-/  "23".  “he  hvcotnesi s  of  ' 4 .  l '  .  *  h“' :( c?  <  *  ;  s  thus 
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enough  to  guarantee  that  (4.3)  is  well  defined  for  all  oroers  r. 

f '• 

Indeed,  -i(t)  =  !  h(s)dy(s)  is  certainly  well  defined,  and,  moreover, 
J0 


Pncf  h2(s)-^2(s)ds 


(sup  !-j»(s)  | ) 

[0.T3 


T 

2  f  w2 


h  (s)ds  <  ®]  =  1 


Hence  t  t/l 

|  h(s])^(s])dy(s])  =  •  I  h(s1)h(s2)dv(s2)dy(s1) 
0  0  0 


is  well  defined,  and  we  can  continue  in  this  manner  to  all  orders. 
Similarly,  by  choosing  a  continuous  version  of  L*,  we  can  show 


Ll  = 


h(s,) 


•b(s  )L 
*+l  r+ 


dy(s. 


•dy(s1 ) 


is  well-defined. 


We  shall  also  encounter  expressions  c*  the  form 
t 


2g{;  > ( s ) dy ( s ) | F; }  , 


0 


and  for  these,  the  following  "stochastic  7ubini"  theorem  is  useful 


Lemma  4,1  Let  (b(t ;,Ft)  be  a  standaro  Wiener  o^cces; 
.%  =  t { b ( s )  s < t }  (’completeo  by-1^*1  sets) .  I*  :  s  i; 

u  —  *** 

m  * '  9 

= ..-adapted  orccess  suer,  tnat  E_  s  :s_ 


]  =  (■ 


x 
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E[  ;  ?(s)db(,s)  ’ 
*0 


=  eL?(s) !F°]cb(s) 
>0  s 


Proof  Liptser  and  Shiryayev  [28]. 


Proof  of  Theorem  4 . 1  Parts  (i)  and  (ii)  are  both  consequences  of  a 
multiple  integral  expansion  for  l  .  Indeed,  (4.3)  implies  chat 

t 

(4.9) 


Lt  =  1 


f 


h(s)L  dy(s)  . 

Jo  S 


Iterating  (4.9) , 


t  S1 


Lt  =  1 


+  f  h(s)dy(s)  +  i  :  h(s,)h(s-)L.  dy(s9)dy(s, ) 


I  ■  -  \  #  -w  \  -  /  ;  :  "  \  J|  /  "IJO/L  \  -  j  t 

Jq  J  QJq  I  *  S2  C 


L+.  = 


1  +  ]  h(s)dy(s)  - 


^0 


r 


’r-1 


JohUl) 


•h( 


3r)dy(sr) 


• • dy ( s ■ 


( r ' 
+  I'  ' 

t 


(4.10 


New  substitute  this  last  expression  into 


The  denominator,  for  example,  oecomes 


’n-l 


En(L  :F{}  =  1  x  >  En{|  •••;  h(s, )---h(s  )dy(s n)  •  • -dy(s, ) ' Fy> 
o  t  u  n=l  u  -0  Jo  1  n  n  1  L 

+  f  ( i  ^  ^ ) 1  py ' 

+  E0(Lt  i '  t '  • 


(4.11) 


f  2  r 

The  hypothesis  E[  h  (s)ds]  <  »  of  part  (i)  allows  lemma  4.1  to  oe 
J0 

applied  to  the  terms  of  (4.11),  with  the  result, 
r  Sn_1 

E0CL.iF^!  =  1  +  7  I  ..•[  EQ{h(s1 ) — h(s„) >dy(s„) — dy(s. ) 

u  u  L  n=l  JQ  h  J  1 


n 


F  a 'Fy>  =  1+7  In(k  )  -  -  :< 

L0'Lt  'V  tv<n'  '  "0'“t  "f 

n=  l 

A  similar  calculation  applies  to  Eq C^v  t ) L^. ;  ~'{) ,  thereby  completing  the 
derivation  of  the  partial  expansion. 


Formally,  the  proof  of  the  full  expansion  follows  by 
setting  r  =  »  in  (4.10).  To  prove  it  rigorously,  we  first  shew 

r  I  2 

that  E  exp[;  fr(s)ds]  <  =°  implies 
J0 


L*  =  m.s. (pj  1 i m [ 1  -  7 

'  M _ 


N  ft  r  n-1 

•••■  h(s, ) • •  -h(s  )dy(s  )---dv( 
n  =  1  C  *0  n  n 


\ 

4)  , 


( 4.i2; 


lenote  the  *inite  series  on  the  richt  ha nc  sice  o*  4.12'.  ov  '-!■  “hen 


rm.  .  ‘  trts-m*  + 
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S,, 


E0(Lt'At)2  =  E0[;  ' 
^  0 


h(s,) 


'h(s'W)LVidy,S:w;'"dy!Sl)r 


By  employing  the  standard  computational  rules  of  (2.1 
integrals ,  this  last  expression  equals 

-L  S, 


■or  stocnastic 


|0"'f0E0[h2(Sl)---|l2(s.N)LsJdS^l"'dS 


N 


N+l 


provided  that  it  is  finite.  However, 


S0[nZ(s,)---h2(S  )l(  ] 

0  1  N  SM+1 


SN+1 


=  EQ{h2(s,)---h2(sJ  exp[- I  h2(s)dsj 
u  1  '  0 


SN+1 


E0Lexp[2j  h ( s ) dy ( s ) 1 ! r M } 
L  J0  S1 


(4.13) 


With  resoect  to  ?Q,x(  * )  end  y(  * )  are  indeoende.nt  and  j[  •  '  is  Brownian. 


,SN+1 


Hence,  given  -,x(s) ,  s  <_  s-j},  1  h(s^dy(s)  is  a  Gaussian  random  /ariatle 


with  mean  0  and  variance 

SN+1 


,*  N+l  ? 

;  h^(s)ds.  Thus 


S 


N+l 


En[exp  2  1  h ( s ) dy ( s ) •  F*  ]  =  exo  2  h’(s;ds  . 
J  •  G  S1  -  C 


id  lai 


Therefore,  apolying  (4,14}  to  ( 4 .  ’  3 ; 
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Sv 


M+l 


(4.13)  =  E0ih~(s1 ) 


2  r  2 

exp [ :  h-(s)ds]; 


E0:h2(s,)- 


,  2,..  s  - 

>n  1 3M+1  '  - 


-  r  N+l  f 1  .-w  j-1 


Jq  Jo  Jo 


n2( 


and  hence 


5N+1 


;n--fnE0&2(5l)-h2(s,+,)i2w:«N+,...s 


0  -0 


’j-l 


j=N+l ' 0  -0 


•  *  • :  Er[h'(s1 )  •  •  -  h2Ui  •  •  -  d3- 


i 

=  l  i-j-En{!  •••;  h’(s-| )  •  • -n“(s  .)ds  .  •  • -ds- ■  . 

i=N+i  0  -'o  JQ  J  J  > 


j=N+l 
,T 


Since  E  exp[!  h“,'s)ds]  <  ®,  (4.15)  tends  to  0  as  N  -  »,  anc  thus 

J0  M 

iL  =  ,ti. s  (?n)  lim  A.  for  all  t  <  '.  Lemma  4.1  can  new  oe  invcnec 


for  every  order  n ,  so  that 


En-iL*  \rZi  =  En{rns.  lim  A'* '  f'{  } 


'0  t 


t  <- 


=  ,71.  s.  lim  En-..nt  r  _  ; 

M  O  >- 


=  m.  s.  (?,!  1  *mi  i  - 


n=  , 
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A  similar  proof  expands  Eg{f(t)L*  F i 


in  one  series 


io(t)  *  nI3'Sun)  • 


Finally,  to  derive  the  L1  (P)  convergence,  note  that 


i  nus 


2  2  ,T 

vfp1  *  ;glt  *  Etw  •  "2(»)*3  <  *  • 


F  i 


eo^fP-(Wi;(:V>: 


<  pl/2(dP_}"  e1/2[c  [l  .  , 
-  -0  ldPQj  L“0LLt'rt- 


-  -n  \ -2 
’  f  lt  (V'’j  • 


Thus  because  of  (4.12),  En[Lf:F^]  =  CL '  (?)  )  lim  [1  e-  T_  I*(kn)j 


-o-  t- 

as  claimed.  This  completes  the  proof  of  theorem  1.1. 


n=l 


Let  P(A,t|F{)  =  E Cl . (x(t) ) j f{]  denote  the  conditional 

Z  LA  '* 

distribution  of  x(t)  given  the  observation  up  to  time  t. 

T 

r  2 

Corollarv  If  E[exp  !  rr(s)ds]  <  » 

J0 


p(a, t : f^ )  = 


El  .  (x(t) )  -  l  I"(E1  Jx(t))h(Sl )  -  - -h(s  }) 
a  ^  t  a  1  n 


1  +  I  I?  (Eh(s1)---h(s  )) 
^  i  n 

n=  i 


A  relates  "formula  is  also  of  interes' 


t  „  \ 

X  - 


:e^$:  :y 


,  x(t)  ras  a  concTzicra!  aensi  -y  giver,  d;/ 


v  En(L(t)!F^x{t)=x]q(x,t) 

P(x,t) ;,>)  -  - - - — 7 - 

1  E0CL(t)!Ff] 


Using  the  same  techniques  as  above,  we  can  easily  derive 


EgCUt)  j r^,x(t)=x]q(x,t) 


»  n  +  i  i?(E[*(s,) 


n=0  " 


•h^sn) ;x( t)=x])j 


q(x ,t) 


(4.16) 


for  the  numerator  of  p(x,t!r^).  (4.16)  is  often  called  the 

unnormal ized  conditional  density. 


Theorem  4.1  immediately  suggests  a  scheme  for  approximating 

filters,  namely,  truncation  of  the  numerator  and  denominator  series 

after  a  finite  number  of  terms.  The  kernels  of  these  terms  are 

evaluated  off-line,  and,  if  necessary,  approximated  by  separable 

versions.  Construction  of  the  multiple  integrals  as  outputs  of 

stochastic  differential  equations  in  the  manner  of  theorem  2.3,  then 

provides  a  finite-dimensional  recursive  realization  of  the 

approximate  filter.  ^.Error  analysis  of  this  method  is  difficult,  even 

f  2 

in  the  case  E[exp  h  (s)ds]  <  »;  because  truncation  occurs  in  both 


numerator  ano  denominator  of  a  ratio  and  because  the 


oerms  ar 


hard  to  bound.  One  might  also  approximate  the  urnormal i zed  conditions' 
P  ( x ,  t :  F  )  by  a  finite  series,  but  a  nago'-  orewcac;<  eo  this  sc.oeme 
is  t.nat  one  cannoo  guarantee  that  ey<,t  rp)  >  3  “or  a‘l  x. 


and  0  <  n  <  r) 
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(it)  a(t)  5  Y  is  called  the  best  r’n  order  filter  of  f  { t } ,  given 
xy(s) :s<t},  if  E[f(t)-a(t)]2  ^  E[f(t)-b(t):2  for  t  <  T,  £or  all 
b(t)  e  Yr>  The  kernels  a.(t)  of  a(t)  are  then  called  the 
optimal  kernels. 

Existence  of  a  best  r^  order  filter  is  not  immediately 
guaranteed,  a ( t )  amounts  to  the  projection  of  f(t)  uocn 
Yr(t)  *  (a(t)  ja(  •)*'{  ':,  ana,  for  this  to  always  exist,  Y„(t)  must 
be  closed  under  mean-square  (P)  limits.  An  easy  sufficient  condition 
i  s 

f  7  r 

Lemma  4.2  If  E(  h“(s)ds)  <  then  Y„(t)  is  mean-souare  (?)  closed 
J  0 

for  t  <_  7. 

Proof  Apply  lemma  2.3  to  observe  that,  under  the  hypothesis, 

for  n  <_  r,  k  c  t^( [0tT]n) . 

To  find  the  best  r  '  order  filter,  one  must  compute 
the  optimal  kernels.  Accordingly,  in  theorem  A. 2  we  show  how  to  use  the 
multipi  icafon  formula  and  the  filter  expansions  to  derive  ''ntecral 
equations  for  the  a„(t),  0  <_  n  _<  r.  "his  reouires  two  ore  I  ini  nary 
lemmas,  ere  to  restate  the  oroblem,  toe  ot.rer,  to  verify  a  -teem:  :a  ’ 
identity,  we  assume  tnroucncut  trie  notations  estaol  *1  snec  ■>  ::  1  . 
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2 ,  _y  ,,2  ...  .  .2 


Lemma  4.3  Let  z,v  t  L“(.T,  F; ,?) .  Then  E(z~f(z})  <_  E(v-f(  t)} 

if  and  only  if  E(z-f(t))2  <  E(v-f(t))2. 


-  2  y 

Proof  f(t)  equals  the  projection  of  f(t)  onto  L  (f.,F^,P).  Since 

z-f(t)e  L^.F^.P),  the  projection  theorem  implies 


E(f(t)-f(t))  (z-f(t) )  =  a  . 


Thus 


E(z-f(t))2  =  E(z-f ( t) )2  +  E(f(t)-f(t))2 


2E(f(  t)-f(t) }  (z-f(t) ) 


=  E(z-f(t))2  +  E(f(t)-f(t) )~ 


Simi larly , 


E(v-f (t) )2  =  E(v-f(t))2  +  E(f(t)-f(t) )2  . 


Lemma  4.4  Let  c(t)  =  cn(t)  +  7  I?(c  (t)'  e  Y.(t)  ana  assume  thai 

u  n=l  1  n  r 

T  T 

E( ^  h2(s)ds)r  <  *>,  E  f 2 ( t ) ( ;  h2(s)ds)r  <  =  .  Then 

Jo  Jq 


{c(t)E0TL^.r)  !  F^> >  =  0 


(4.17) 


!  r  i  v 

:0  {c(t>  £Q{f ,tjL:  -  ;•  ;•  =  o 


a  IS 


Proof  From  (4.11) 


p 

c  { !  ( r )  =  E  rL  '  Fy  1  -  1  -  7  infk  ) 

>rtJ  O  '-  t  ■  tJ  “  1 1  n  ’ 


n=l 


and,  therefore, 


Eo{c(t)E0a[r)|Fy» 


“0C  ( '  Eo  [*- 1 1 


Enc(t)[i  +  i  i"(kn): 

0  n=l  L  n 


Since  y(-)  is  Brownian  under  Pr 


r  e 

„  I 


“n-l 


E0:=U)[1  *  y  -rJ(k„)];  -  c0(t)  ♦  I 
--  n= 1 J 


n= 


0  0 


■j  Cn't,Sl  *' 


’V 


E[h(s1)*--h(sn)]dsn-*-ds1 


However, 


.0{c(t)E0[Lt!  Fp}  .  E,t(t)  &  • 


_dP 


:C(  t; 


r  1  rn~} 

'  c°(£) 


E[h(s-j )  *  *  *h(sn)  jdsn*  •  •ds] 


by  lemma  2.5.  Applying  (4.1Sb)  and  (4.20)  in  (4.19a)  yie 


.  _  ..  _v 


-Q"'-  "~0  ~t 


■  _  ;  ;  ~  j 
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(4.18)  is  established  in  analogous  fashion  using  a  version  of  lemma 

2.3  for  expressions  f(t)  I?(c  ),  n  <_  r,  under  the  condition 
.  >  t  n 

E { f 2 ( t ) ( ]  h2(s)ds)r>  < 

J0 

T 

T  ?  2  r 

Suppose  now  that  E(j  h  (s)ds)  and  a(’)  e  Yr-  By 

expanding  E  [L.  jF^]  as  in  theorem  4.1,  we  obtain 
n  t  t 


'('>  E0[Lt|Ft]  ‘  [aQ(t)  *  j, 


2  r  4 


Cl  +  I  ii  (k.)  +  En{L^2r)  !F>;>] 

j=1  t  j  0  t 


(4.21 ) 


Using  the  multiplication  formula,  we  can  then  calculate  kernels 
gj  (t,s1  •  *  ,sj )  such  that 

3  r* 

(4.21)  =  gQ(t)  +  ^  I'j(gj-(t,s1  ,•  •  •  ,Sj ) )  +  a(t)  EQ{L^2r'  F^} .  (4.21a) 

Indeed,  for  G  <  j  ^  3r,  direct  calculation  with  (2.14)  gives 


•,  i..  eg, 

0  (m,n,i )sC ■ 


(4.22) 


C  •  =  {m ,n , i  m^n-2:= j , i <mi n\m,n) ,m<r ,n<2r } 


'hecrem  4.2  Assume 


o ,  ,  ,?>" 

n  '  s ,  ds  „  '  =»  and 


,  ^ 


CM 
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Thus,  we  will  demonstrate  (4.24).  Begin  by  noting  chat 


r  r 0  1  r-'J  1  _  (  -  rdP  t  -1  x  -  1  /  -  - .  r'J  ~  \  ‘ 

CLdP  :'tJ  ■  u0LdP0;rtJ^  "  'v “0 L L t 1 1  tJ ; 


E[c(t)-a(t)][a(t)-f(t)] 


_  ( c  ( t )  -  a  ( t ) )  [ a  ( t )  E Q  { L  t I  F* }  -  E Q  { f  ( t )  L  y  ?'{ } , 


■0LLt  rtJ 


E{E[%'Ft](c(t)'a(t))[a(t)E0[Lt'^:  -  :Fp:> 


=  E0C(c(t)-a(t))(a(t)£0[Lt!F{j  -  EQCf ( t J Lt ■ ) }  .  (4.25) 


Now  use  theorem  4.1  (i)  and  (4.21a)  to  evaluate  the  term 


a ( t ) E 0 [ L t ! r^]  -  £Q[f(t}Lt  r{] 


2r  i 

gn(t)-Ef(t)  +  t  I*.g^(t)-t  •  (t;2 

u  i=i  u  j 


3r 

I  i;[g,(t); 

j=2r+l  *  J 


+  ,  -  r-  (2r)  -V-. 

+  av^;t:rL-. 

■J 


mis  imo.ies,  since  j\  • ;  is  Brownian  .  t . 


:ntecrs  s  are  crorccor.a .  w.r. 
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4.25)  =  [Cg ( * )~  3g 1 1 ) ] ( 9g ( t )  ~lg [t]  .  ~ 


L 

j=l 


t  sj-! 

j'o-Jo  Ccj(t’si’-"’sJ) 


**  a.u>s1  ,••  •  ,s,. 


[g  .(t,s,  ,•  •  •  ,s  .)  -  i  -(t,s,  ,  • ■ •  ,s,)] 

J  1  J  J  ‘  J 

ds,  • • -ds. 


+  Eg{(c(t)-a(t))a(t)EQ[L^_r'  Fp} 


-  Eg{(c(t)-a(t))E0[f(t)L <2r)jr£!>  >  (4.26' 


The  last  two  terms  of  (4.26)  are  zero  by  lemma  4.3.  Thus,  it  is 
clear  that  (-.26)  is  zero  iff 

V  :J  • 

This  completes  the  proa*. 


The  technique  of  theorem  4.2  extents  to  ether  problems  a: 
well.  Suppose,  for  instance,  that  a  filter 

q 

jiftl  -  ,'/«•>  i.  J  r  d  !  3 '  /  4.  '  N 
3  V  t  )  ~  4g  i  U  /  u  *  t  ^  "  I"  *  '  ' 

U  _  ]  W  J 

J  I 


of  order  q  is  available;  a'(t)  need  net  oe  the  test 

£i 1  ter .  Let  -  >  q,  and,  rather  than  as:<  *or  the  test 

- .  - .  -  *  „  , ,  t  h  .  , 

r i  i  ter ,  e 1  j 3  5  9  9  <  t n 6  j6S*  ^  c  r  e  9  r  3  c  ^  v“  9  c  r " *■  r 

*.9.,  the  -ean-scuare  rrir'-*  z^nc  13  : -  tee  *r:^ 


th 


:  r  e  9  r 


Y  —  ‘ 
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a(:)  =  ajj(t)  +  l^a^vtJ 


q  .• 
j  =  l 


-  7  7  -  a  -  v 

x  \  3  4  \  "  ,  / 


j=q+l  w 


where  a.(t),  j  =  q  +  !,•••, r  are  free  to  be  chosen.  Define  the 

J 

kernels  g.(t)  as  before: 


3r  , 


a(t)EQ{L*  j  r^}  =  gQ(t)  +  ^Mj(g.(t))  +  a(t)cQ-;L^ 


(2 r)  ■ -y. 


Theorem  4.3  Let  the  hypotheses  of  theorem  4.2  hold.  Then  a(t)  is 
th 

the  best  r  ‘  order  correction  to  a'(t)  if  and  only  if 

g.j(t,s,  ,•  •  •  ,3  .)  =ECf  ( t )  h  ( St )  ■  •  -h(sj  q  ~  1  <  j  <  r .  (4 
j  i  j  i  J  —  — 


Proof  As  before,  it  suffices  to  shew  that  {4.27}  holds  i 


E[c(t)-a(t)][a( t)-f(t)]  «  0 


for  all  c(t)  =  a  * { t )  +  7  li(c.(t}).  3y  the  same  calculations  as 

j-q+1  L  J 

theorem  4.2 


E[c(t)-a(t) j[a(t)-f(t)] 


:0:Cc(t)-a(t)j[a(t)E0{Lt!Fp  -  EQ{f{t)Lt 

-  i  2r  i 

:  {  >  lJ(c.-a.)[g  (t)-tG(t)  -  :  IJ.[g,(t)-,., 

J  j=Q+l  '*  J  ■<  J  J  j  =  l  ^  - 
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"his  equals  zero  iff  g.  = 

9 


tor  q  -  1  _<  j  <_ 


Remark  Clearly,  an  analogous  result  holds  for  the  case  in  which  an 
arbitrary  subset  of  {a.}^  n  is  given  and  the  remainder  are  chosen 
as  to  optimize  the  mean-square  filter  error.  Thus,  if 
a^,  j  £  {jj ,  •  •  •  ,jq}  £.{0,1 ,  •  •  •  ,r>  are- given,  then  the  {a,.(t)}, 
j  t  { j  -j  » *  -  *  >  j  >  are  cptimal-ly  chosen  iff  g,  =  i.  for  every 
i  ?  i  n  i  . . .  — ■ r i  • • •  i  ' 

<J  ~  I  )  1 1  J  ,  \  J  1  >  » J  _  -  • 


As  a  first  example  of  theorem  A. 2,  let  us  compute  the 

kernel  equations  for  tne  best  linear  estimate  f(t}=  a„(t)  ^ 

*  u 

L 

f  a, (t,s)dy(s) .  From  (4.22) , 


gQ(t)  =  aQ( t)  +  '  ai(t,a)£[h(c) jds 


5-j  ( t ,  s )  =  a,  (t,s)  -  ;  a.  ( t  ,u )  E[h(  s )  n  ( n )  I  dr  -  a^tl.Eh  s. 

*  0 


"he  kernel  equations  are  then 


aQ(t)  -  (  a, (t,j'Eh(r}dr  =  £f(t) 
u  '  Q  1 


a0( t)Eh(s)  -  a . ( t , s )  +  ^  a, (t,r)E[h(s)h(r)]dc 
J  -'O1 

eliminating  an{t)  *y'om  che  second  equation. 


=  E*(  t) h(  s ,  or, 
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3  n  V  ^  )  ’’’  3-|(t,j)w  h(j;_dc  -  1  T’i  -  / 

J  '0 


i'i.22' 


a  i  ( t ,  s )  +  [  a,  (t,-)cov[h(s)  ,h.(r)3d-  =  cov[f(t)  ,h(s)j 
'  Jq  ' 


(4.28)  is,  of  course,  the  well-known  Wiener-Hoof  type  equation  fo r 


otoirnal  linear  filtering.  Sefore  examining  higher  -rder  examples, 


we  will  discuss  the  Kalman  filter. 


4.3  The  Kalman  -ilter 


Consider  again  the  signal-observation  system  (3.1)  in 


which  the  state  x(t)  is  a  Gauss-Markcv  process  solving  a  linear 


stochastic  differential  equation  driven  by  Srownian  motion,  and 


h(t,x)  =  H(t)x(t).  The  Kalman-3ucy  theorem,  summarized  in  lemma  2.1, 
shows  that  the  state  estimator  x(t)  =  E{x(t)  '{}  satisfies  the 


equation 


dx ( t )  =  F(t)x(t)dt  -  ?(t)H : (t) [dy(t;  -  H(t)x(t)ct; 


x(0)  =  x. 


:  X  n  c  ^ 


where  ?(t)  is  a  deterministic  function,  (see  chapter  3  for  der i ? 


Let  $(t,s)  be  the  state  transition  matrix  of  F(t)  -  P ( t ; H ' (t)H(t) . 


fhen,  the  solution  o*  (4.2S)  is 


",  '/  ^  ^  /  X  Q 


s ,  d  y 


i . e . ,  t.ne  octimal  estimate  is  1'oear  -'n  y. 


£ 
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is  of  interest  to  connect  this  reso't  to  the  excans' :n 


formulae  of  theorem  4.1  In  the  case  or  a  scaiar  signal, 


satisfying  (3.1'  and  h(t,x)  =  x,(4.7)  yields,  at  leas 


t  r  o  rma  :  !  y , 


Ex( t)  -  }  l}(ELx(t)x(s,)”-x(sJ]) 

•  u  *  w  ^  0 


x(t)  = 


(4.30! 


1  +  T  I;  ( E  ( x  ( s , }  •  •  •  x ( s  - )  5 

J  *"  ’ 


and  both  numerator  and  denominator  a^e  truly  infinite  sums,  .hns 


general  i^epre 


sentation  obscures  the  linear  structure  o~  xvt, 


techniques  for  apolying  the  expansion  formulae,  snculc  at  .east  me 


methods  for  deriving  the  linearity  o-  x 


- -cm  .4.30;.  m  “act, 


theorem  4.2  can  be  parlayed  into  a  preo^  o*  me  Kaiman-ruoy 
and  we  present  this  here  after  a  *"ew 


One  common  proof  of  the  !<a  iman-Eucy  t.neorem  invekei 
stochastic  differential  equations  for  the  conditional  moments. 
x(t)  =  b  ( t )  and  h(t,x}  =  x,  where  o-.  t,  is  a  sca.ar  sr  own.  an 
these  are 


At) 


-  n(n-1 )  ',n-2 


.  H  I  ' 

~  t  <  (  t ..  '  “  X  I,  t ,  X 


[dy(t)  -  x( 


i  r  :  n  •  •.  m  :  e  s  <■ 


~  j  -  s  ^  < '  ,  •'  2  '  d  n  o‘j  t , 


' Q :  e  r  -  -  3 


X  AO-A096  973  MASSACHUSETTS  INST  OF  TECH  CAMBRIDGE  LAB  FOR  INFORMA— ETC  F/G  12/1 
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DEC  80  D  L  OCONE  AFOSR-77-3281 

UNCLASSIFIED  LIDS-TH-1058  AFOSR-TR-81-0232  NL 
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not  yield  themselves  to  a  direct  solution.  Rather,  they  require 
additional  information,  namely,  that  (x(t),y(t))  is  jointly  Gaussian 
and  that,  hence,  by  limiting  arguments,  the  conditional  distribution 
of  x(t)  given  {y ( s )  [ 0  <_  s  <_  t>  is  normal.  One  can  then  conclude 
that  the  conditional  variance  E(x-x)^[Fp  is  a  deterministic  and  so 
effectively  truncate  the  system  (4.31)  at  n  =  2.  (4.29)  follows 

easily  [see  Kail ianpur-Striebel  [23]].  By  contrast,  the  derivation 
of  x(t)  from  (4.30)  does  not  involve  knowing  the  form  of  the 
conditional  distribution,  an  object,  that,  in  the  general  filtering 
problem,  is  not  often  in  hand. 


Let  us  develop  our  proof  of  the  Kalman  filter  for  the 

simple  case 


dx(t)  =  db(t)  x(0)  «  0 
dy(t)  «  x(t)dt  +  dw(t)  y(0)  =  0 


(4.32) 


in  which  b(-)  is  a  Brownian  motion.  We  do  this  in  the  interest  of 
computational  simplicity;  the  proof  carries  over  to  the  general  case. 


ft 


Theorem  4.4  x(t)  = 
Wiener-Hopf  equation 
(s'o  =  min(s,a)) . 


a(t,s)dy(s|  where  a(t,s)  satisfies  the 
a(t,s)  +  a(t,o) (s'cr)dc  =  s,  t  >  s, 

Jo 


(4.33) 


Proof  Since  y(-)  is  Gaussian,  the  set  of  polynomials  in  y(-)  is 
dense  in  L^(r»,F^,P) ,  a  fact  presented  in  the  discussion  of  hcmo- 


1 1  iiMUltf 
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geneous  chaos  theory  in  chapter  1.  Therefore,  it  suffices  to  show 
f  th 

that  a(t,s)dy(s)  is  the  best  r  order  estimate  for  every  r, 
JO 

1  ^  r  <"«.  It  is  true  for  every  r,  T  <  *  and  i  <  T  that 
~t  t 


EC 


b2(s)ds]r  <  -  E  b2(t)[ 


b^(s)ds]r  <  ».  Theorem  4.2  thus  applies. 


That  is, 
if 


a(t,s)dy(s)  is  the  best  r^  order  estimate  if  and  only 


9j(t*sl**‘'»sj)  =  Efb(t)bCs1)*'*b(sj-  )>»  j=0,l ,  • '  •  ,r  . 

From  (4.22) 

90(t)  =  0 

Sj(t»si » —  »sj)  s  ,)0(t)kJ._1)(s1,*--,sJ.) 

+  (a(t,  •)0(t)(kj+1)1)(s1,--*,sJ.)  j  >  0. 
Now 

j ( a ( t , • )  G  (t)kj._1)(s1,---,sj.)  =  ij-  a(t,sff^j) 

J 

Ei><s,<Z)>'"b(s,(j)> 

j 

*  a(t»si)E{b(s1 ) — b(s1_1 )b(si+1 ) — b(s^)>  , 
and 

(a ( t . • ) i  0(t)(kj+1)1(s1 ,  —  , s j ) 
t 

-  I  a(t,a)E{b(-)b(s1 )  *  *  * b{ s  .)  >dcr  . 

J  0  1  J 


ft 


MMiaHli 
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The  kernel  equations  become 
0  =  Eb(t) 

t 

* 

a(t,s)  +  a(t,a)E{b(a)b(s) }da  =  E[b(t)b(s)] 

J0 

ia(t,si)E(b(s1)...b(si.1)b(s.+1)...b(sn) 
t 

+  a(tt<j)E{b(cr)b(s-i)"*b(s.)}da 
JQ  1  J 

=  E[b(t)b(s<j)*  •  ’b(Sj)]  (4.36) j 

2  <  j  <  r 

(4.34)  is  true  by  definition,  and  (4.35)  is  just  (4.33).  We  now 
claim  that  if  a(t,s)  satisfies  (4.35),  (4.36)j  is  true  for  all 

j  2.  This  will  imply  that  the  equations  for  1  a(t,s)dy(s)  being 

th  '’° 

the  best  r  order  estimate  are  satisfied  for  every  r,  and  will 

complete  the  proof.  To  do  this,  assume  a(t,s)  solves  4.35,  and 

observe  the  identity 

E{b(a)b(s-| )  •  •  - b ( s j ) }  =  J  ( . )E[b(s1 )  •  •  •  b( si _1  }b(si+1 )  •  *  *b(Sj)] 

M  (4.37) 

(4.37)  results  from  a  direct  application  of  lemma  3.3. 

Now  substitute  (4.37)  into  the  left-hand  side  of  (4.36). 
and  use  the  Wiener-Hoof  equation  for  a(t,s) : 


(4.34) 

(4.35) 
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£  a(t»si)E[b(s1)*--fa(si_1)b(s1+1) — fa(s^)] 


JO 

j  t 


a(t,ff)E[b(c)b(s1) •••b(sj)]do 


*  l  {a(t,s .)  +  a(t,c)Co-sJdff}  E[b(s, )  •  • -b(s,  ,)b(si 

1=1  1  Jq  1  1  1,11 


35  .1  (*1*0  E[b(si)---b(s._i)b(si+1)---b(sj)l 


a  E[b(t)b(s-| )  *  *  *b(Sj)]  . 

The  last  equality  employs  lemma  3.3  again  and  validates  (4 . 36) j  for 
any  j. 


4.4  Quadratic  filters 

In  this  section  we  treat  best  second  order,  or  quadratic, 

filters  as  an  example  of  the  theory  of  4.2.  We  first  present  the 

optimal  kernel  equations  for  this  case  and  then  show  how  they  may  be 

solved.  To  guarantee  validity  of  the  discussion,  we  assume  throughout 

the  hypotheses  of  theorem  4.2  for  r  =  2: 

T 

E( !  h2(s)ds)4  <  - 
JO 

T 

Ef2(t) ( [  h2(s)ds)4  <  »  vt  <  7  . 

J0 


Deriving  the  integral  equations  is  simply  a  matter  oi 


calculation.  Let  a(t)  =  an(t)  +  a,(t,s)dy(s)  + 

r*  U  A  * 


I  j  a2(t,s«|  ^Jdy^Jdyts-j )  be  a  quadratic  estimate  and  let 

Vo 

9 -(t,s,, 0  <  j  <_  2,  be  the  kernels  associated  with  a(t) 

J  *  J 

in  the  manner  of  (4.21a).  Thus,  using  (4.22) 


90(t)  35  aQ(t)  +  [a1  (t. •  )3i  G  (t)[k1]1  +  [a2( t , * )  12 0( t )  [k2]  2 

9-|  (t,s)  =  a-j  (t,s)  +  aQ(t)k1(s)  +  [a1(t,*)31  O  (t)Ck2]1(s)  (4.38) 

+  [a2(t,*)]1  ©(tJEk^^s)  +  [a2(t,-)l2  0(t)[k3]2(s) 

g2(t,s^  ,s2)  ~  a2(t,s-j  ,s2)  +  ag(t)k2(s^,s2)  +  fa  j  ( t ,  • )  J  0(  t)f<-j3Cs-j  ?  s2 ) 


+  [a1(t)]1  O  (t)[k3]1  ( s-,  ,s2)  +  2[a2(t)]1  ■ 

0(t)  [k2]1(s],s2) 

+  [a2(t)]20  (t)[k4]2(Sl,s2). 

By  theorem  4.2,  a(t)  is  optimal  quadratic  if  and  only  if 
Ef(t)  =  gQ(t) 


Ef(t)h(s)  =  g^t.s) 

Ef (t)h(s1 )h(s2)  s  g2(t,s1,s2) 


(4.39) 


Now  evaluate  g.,  0  <_  j  <_  2  in  (4.39)  using  (a. 38)  ana  the 

J 

definitions  of  0  (t)  and  k..  The  result  is  in  its  *'u;l  blown 

J 
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ugliness, 


Ef(t)  ■  aQ(t)  +  J  a1(t,s)Eh(s)ds  +  j  j  a2(t,Sj ,s2)Eh(Sj )h(s2)ds2ds1 

0  0  0  t  a  /in 


Z 

Ef(t)h(s)  »  a-|(t,s)  +  aQ(t)Eh{s)  +  a-j (t,a)Eh(c)h(s)dc 


(4.40a) 


+  a2(t,s,a)Eh(a)da  +  |  j  a^t,^  ,c2)Eh(c^ )h(a2)h(s)ds 
°  00  (4.40b) 

Ef(t)h(s1)h(s2)  =  a2(t,s1,s2)  +  ac(t)Eh(s1 )h(s2)  +  a1 (t,s1 )Eh(s2) 


+  a1(t,s2)Eh(s1 )  +  a1  (t,j?)Eh(c)h(s1  )h(s2)dc 


+  [a2(t,sT  ,a)Eh(a)h(s2)  +  a2(t,s^)Eh(r)h(s.)]dc 


«■»  WJ 

+  f  f  a,(t,ff,  ,j-)E{h(s,  )h(sjh(r,  )h(o_)}  dc7dc,  (4.40c) 
Jq  jq  4  14  l  i  1  l  L  \ 


These  equations  deserve  some  elementary  remarks  before  we  set 

about  solving  them.  First,  the  optimal  kernels  are  all  interrelated 

in  the  general  case.  We  cannot  solve  for  aQ  and  a^  independently 

of  knowing  a2  .  Likewise,  if  ag  =  Cg  ,  a^  =  c^  are  the  kernels 

of  the  best  linear  estimate,  they  will  not,  in  general  be  the  lower 
order  kernels  of  the  best  quadratic  estimate.  Secondly,  the 
equations  (4.40)  can  be  used  for  other  suboptima  1  designs  in  the 
soirit  of  tneorem  4,3.  Thus,  if  ag  and  a.  are  given,  ana  we 


Ik 
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ft 

seek  the  best  quadratic  correction  to  an(t)  +  :  a,(t,s)dy(s)  , 

u  •'  o 

this  will  be  found  by  solving  (4.40c)  for  a2  in  terns  of  a^  and 

ag  .  The  methods  developed  for  solving  the  full  set  of  equations 

(4.40)  will  also  apply  to  this  problem. 

To  solve  (4.40),  we  first  eliminate  ag  and  a^  to  derive 

an  integral  equation  solely  for  a2  .  3g  is  easy  to  handle.  Merely 

solve  (4.40a)  for  ag(t)  in  terms  of  a^  ,  a2  ,  and  the  known 

functions  Ef (t)  ,  Ef(t)h(s)  ,  etc.  and  substitute  this  expression 

In  (4,40b)  and  (4.40c).  To  further  simplify,  use  (4,40b)  in 

a1  (t.s-j ) Eh (s2 )  +  a1  (t,s2)Eh(s1 )  of  (4.40c).  We  thus  derive 

t  to, 

f  r  1 

aQ(t)  *  Ef (t)  -  j  a1 (t,o)Eh(a)da  -  j  a2(t,a1 ,a2)EH(a1 )h(o2)da2da1 

(4.41a) 


a-|  (t,s)  =  cov[f(t)  ,h(s)]  -  j  cov[h(s)  ,h(o)]a1  (t,c)da 


Eh(a)a,(t,s ,a)do 

'l  t 


t  a, 
r  r  1 


-j  j  cov[h(s),h(a1 )h(a2)]a2(t,o1 ,o2)dc2dc1  (4.41b) 


a2(t,s1,s2)  =  cov[f(t) ,h(s1 ) ,h(s2)]  -  J^cov[h(s1 ) ,h(s2) ,h(c)]a1 (t,c)dc 


-  j  [cov[h(s1 ) ,h(o)]a2(t,s2c)  + cov[h(s2),h(o)]a2(  t.s,  c)dc 


n  -J1 


cov[h(s1 ),h(s2),h(r1 )h(o2)ja9(t,r, ,z2)cc7iz. 


(4.41c) 
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(In  these  expressions,  cov[x-j  =  E(x^-Ex^ }  •  •  •  (xr~Exr) . ) 

We  have  yet  to  eliminate  a^  from  (4.41b)  and  (4,41c),  but 
this  requires  some  more  notation  and  a  bit  of  theory,  Define  the 
operator  R^t):  L2([0,t])  -  L2([0,t])  with  kernel 

r.j(s>a)  =  cov[h(s)  ,h(a)j  by 


(R1(t)g)(s)  = 


t 

cov£h(s) ,h(a)]g(a)da 

0 


R^(t)  appears  in  (4.41b)  and  (4.41c),  In  particular,  (4,41b)  may  be 
rewritten  as 


[I+R-j  ( t ) ]( a i  (t, ) ) (s)  =  cov[f(t),h(s)]  -  j^Eh(a)a2(t,s,a)dc 

1  ral 


cov[hu)  ,h(a-j  )h(a2)]a2(t,a1  ,a2)dc2dcr1 


(4.42) 


and  thus,  solving  for  a^  in  terms  of  a2  requires  inverting 
l+R-|(t).  Fortunately,  this  can  always  be  done  in  an  explicit  way. 

t 


Lemma  4.5 


i)  h(t)  has  a  best  linear  estimate  h(t)  =  a«(t)  +  i  cu  (t,s )dy(s ) 

u  Jo 

t  <_  T  .  Without  loss  of  generality,  we  take  c^(t,s)  =  0  for 
T  >  s  >  t  >  0  . 


ii)  I+R-j  (t)  is  invertible,  and  [I+Rj  (t)]"''  =  l-Q(t)  where  Q(t) 
is  the  integral  operator  with  kernel 


q(t,Sj  ,s2)  _  3-j(s-|,s2)  +  ctj  (s2  ,$i ) 

,t 

“  I  3,  (u  jS,  )ci,(«,Spj  dc»  0  <  s,  ,s,  <  t 

J  0  1  '  1  6  “  1  4  - 


i 
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rT2  4 

Proof  E['  h  (s)ds]  <  »  guarantees  that  h(t)  exists  (lemma  4.2} 
JQ 

and,  as  in  (4,28) 


C  1 

ai  Cs-j  ,s2)  +  (s-j  ,a)  cov[h(s2) ,h(a)]dc 


=  cov[h(s1 ) ,h(s2)]  0<  s2  <  1  <  T 


ii)  follows  from  results  of  Kailath  and  Geesey  (Geesey  [16], 
Chapter  3).  These  imply  that,  under  the  hypothesis 

[  cov[h(s) ,h(s)]ds  <  *  ,  which  is  certainly  implied  by 


,  I+R^(t)  is  invertible  and  its  inverse  has  the 


E[f  h2(s)ds]4  < 

Jo 

given  form. 


We  now  apply  this  lemma  to  solve  (4.42)  for  a,(t,s)  , 


a,(t,s)  =  cov[f(t) , h ( s ) ]  -  l  q(t,s,o)  cov  [f(t) ,h(o)lda 

1  Jo 


r  '(t,s,a,  ,a?)a2(t, c1  ,c2) dc2dc  1 


(4.43) 


where 


r*  (t,s,o1  ,a2)  -j  cov[h(s)  ,h(o1  )h(a2)] 

+  ^  [q(t,s,a2)Eh(a1 )  -  qft.s,^  )Eh( o2)] 


+  2  j  q(t,s,a)  cov  [h(cr)  ,h(o1  )h(o2)]dc  . 

In  deriving  r‘  ,  we  took  advantage  of  the  symmetry  of  a7  to 
symmetrize  r'  .  Finally,  we  substitute  (4.43)  into  (4.-lc)  to  get 


% 


-  1 Q1 


^2 Ct , s -j  >$2) 


F(.tjs1,s2) 

ft 
0 

t  ft 


j  [r]  (s1  ta)a2(t,s2,a)  +  r-j  )^pC'->2-j  ,j)ldc 

h2^t,s-|  JS2 ,a-|  (4,44) 


where 


F(t,s1  ,s2)  *  cov[f(t),h(s-|),h(s2)] 

rt 

-  j  cov[h(s.j ) ,h(s2) ,h(c)](cov[f(t) ,h(o)] 

ft 

-  j  q(t,a.|S2)  cov  (f(t)  ,h(32)]da2)da 

r2(t,s-j  ,s2 ,^1  ,j2)  s  [cov[h(s-| )  ,h(s2)  »h("i )  ,h( j2)] 

-  cov[h(s1 ) ,h(s2)]  cov  [h(c1 ) ,h(c2)]] 
ft 

-  j^cov(h(s1 )  ,h(s2)  ,h(->)  j  r  (t,-.,r5  ,c2'dr  . 

'fie  have  shown  that  if  (ag,a.j,a2)  solve  (4. 40)  then  a2  solves 
(4.44).  Conversely,  by  reversing  the  steps  of  this  derivation,  if 
a2  solves  (4,44)  and  ag  and  a^  are  defined  via  (4.43)  and 
(4.41a),  then  ag,  a^ ,  a2  solve  (4.40). 


(4,44)  is  simoly  a  linear  integral  for  ,  However,  the 
middle  term  of  (4.44),  involving  a  tensor  contraction  between  a 
and  r^  ,  is  non-standard,  and  the  usual  linear  integral  equation 
techniques  do  not  directly  apoly.  In  what  follows,  we  will  show  now 


(  J 
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to  eliminate  the  tensor  contraction  term  to  derive  a  Fredholm  integral 
equation  for  a7  ,  thus  reducing  the  kernel  equations  to  a  well- 
known  problem  for  which  methods  of  solution  or  approximate  solution 
are  readily  available. 


It  is  useful  to  do  this  in  a  general  context. 

Definition  4.2.  Let  3(s^  .s^s  L2  ([0» t]2)  and  let 
2  4 

y(s-i  ,$2^1  ,c,)e  ([0  ,t]  )  such  that  y  is  symmetric  in  s^s,  .  The 

operator  r :  L2([0,t]2)  -  L2([01 t j2 ) 


(•t  r t 

(Tc) (s1 s2)  =  [3]1©(t)[c]1 (s1 ,s2)  +  j  |^y(s1 ,s2,a1 ,c2)c(o1 ,o2)dc2da1 

is  said  to  be  of  tensor  contraction  (T-C)  type.  The  kernel  3  can 

o 

alto  be  used  to  define  an  integral  operator  on  L  ( [0 , t] }  ,  which 
we  will  denote  by  3  ,  and,  in  fact,  we  can  write 


ft 


[p]f?(t  )[c]i  {s] ,s2)  =  |  [3(s1 ,a)c(s?,j)  +  3( s 


f)c(s. 


»-)]d c 


JO 


=  (Bc(s2,*))(s1)  +  (3c(s1  ,-))(s2; 


Remark:  It  is  of  interest  to  note  that,  while  the  second  term  of 
r  is  the  usual  compact,  Hilbert-Schmidt  operation,  the  tensor 
contraction  term  is  not  compact  in  general.  Since  compact  operators 


have  finite  dimensional  eigenspaces,  we  can  prove  this  by  sucolyinc 

1 


a  3  such  that  iceL_([0,  tj2) ! c  =  £1G(t)c1;  is  infinite 


dimensional . 


is  easily  seen  that 


JO. 
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3Cs1  ,s2)  =  ^3-1(s1)-iCs2)  , 

where  the  i.  ,  1  <_  i  <  M  <  =°  are  mutually  orthcgonal ,  will  worx. 

Then  if  are  functions  orthogonal  to  Span-;:^ , •  • , ;  , 

M 

c(s1  ,S2)  [?i  (s-|  )Oi  (s2)  +  ( s2)^i ( si ) ^  satisfies  c  =  3-j  •  ( t)c^ . 

The  space  of  such  solutions  is  clearly  infinite  dimensional. 


For  a  T-C  operator  f  characterized  by  kernels  S  and 


we  want  to  solve  the  integral  equation 


c  =  F  +  Tc 


,  FeL2([0,t]2) 


(4.45) 


which  generalizes  (4.44). 

Theorem  4.5.  Suppose  that  1-3  is  invertible  and 


([I-B]_1o)(s)  =  o ( s )  +  [V  (s,o)q(o)dc 


3]  e L2([0,t]2) 


rhen,  if  c  =  Tc  has  no  non-zero  solution,  I-f  has  a  bouncea 


inverse. 


Proof  Using  (1-3)  we  will  derive  a  Fredholm  equation  for  c 
from  (4.45).  Thus,  suppose  c  =  F  +  fc.  From  the  definition  of 
we  have 

rt 

[(I-B)c(si , *)](s2)  »  c(s1 ,s2)  -  I  S(s2,5)c(s, ,o)dc 


=  PCs, ,s7)  +  1  3(s, ,c)c(s9,r)a- 
1  1  -O' 


:Q  '0 


r  m  _  \  r  _  __  \ 

v.  s,  ,^2’“!  ’:2:C[  ~V2-' 
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Sy  inverting  I-B  and  rearranging  terms,  we  derive 


ft 

C(I-B)c(*  ,s2)3 (s-j )  =  F(s1  ,s2)  +  ' (s2 ,a) FCs-j  ,o)dc 

ft  r t 


,Vsl's2'al  )c(cr1  ,a2)dc2da1  (4.46) 


where 


Y*l  (s-j  i$2 ’^2 )  =  y(s*j  * ^2 ’^1  ®  C$2 ) 3 Cs-j 

rt  i 

j  3  (s2,n)  C s -j  >r  ,cj2 )dn 


Now  invert  (4.46),  to  get 


rt  rt 

c(S]  ,s2)  -  Fi  (Si  ,s2)  j  j  ( S-|  ,$2 , , o2 ) c (c-|  ,c2 ^ do^^o'-j  (4.4/; 


where 


Fi  (Si  ,s2)  ~  F(s^,s2)  +  J  3  ( s2  ,o )  F  ( s-j  ,o )  do 


+  f  31  (s1  ,o1  )[F(aj,s2)  +  |  31  (s2,o2)F(o1  ,o2)dc2]do1 


and 


ft 


1 


Y 3 ( 5 1 ,s2 ,0i ,o2)  =  y i (Si ,s2,Oi ,o2)  +  j  3  (Sl ,n)Y1 ( n  >  s  2 ,o i »-2)dr 

(4.47)  is  the  desired  Fredholm  equation  for  c  ;  if  c  solves 
c  =  F  +  Fc  then  c  satisfies  (4.47).  Conversely,  if  c  satisfies 
(4.47)  then  c  =  F  +  fc  .  Analogous  reasoning  shows  that  c  =  7c  iff 


:(Si,s,)  =  (r,c)(s,,s9)  = 


f- 


'V  '“3‘’n:,l,32/  "  j  jqv3^s1  ,32’”l  >r2'dc2dci 


(i.48) 
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Thus,  if  c  s  7c  has  no  nonzero  solution,  neither  does  c  =  T^c  . 

But  7^  is  a  compact  operator,  and  hence  the  Fredholm  alternative 
theorem  implies  that  I is  invertible.  This  clearly  implies  that 
I-r  is  invertible  also. 

The  statement  of  theorem  4.5  may  seem  odd  because  it  does  not 
focus  on  the  central  equation  (4.47).  This  is  done  to  emphasize 
that  once  the  Fredholm  equation  is  derived,  we  need  conditions  to 
guarantee  it  can  be  solved.  Stipulating  that  1  not  be  an  eigenvalue 
of  T  provides  just  such  a  criterion.  Further,  the  statement  of 
theorem  4.5  may  be  extended  to  a  sort  of  mutant  Fredholm  alternative 
for  tenser  contraction  operators. 

Corollary.  If  I-AB  is  invertible  and  \  is  not  an  eigenvalue  of 
7  ,  then  I-A7  is  invertible. 

Proof.  Completely  analogous  to  that  of  theorem  4.5. 

The  equation  (4.44)  for  a2(t,s.j,s2)  is  of  tenser  contraction 
type;  in  fact,  we  may  write  it 

a2(t,SpS2)  =  F(t,s1,s2)  +  [R(t)a2(t,*)Ks]  ,s2)  (4.49) 

where  R(t)  is  the  tenser  contraction  operator  characterized  by 
the  kernels  3(o^ ,a^)  =  -r(a^,o2)  and 
y(s1 ,s2,o1 ,a2)  =  -r2(t,s1 ,s2,o1 ,o2)  , 

Theorem  4,6.  I-R(t)  is  invertible  and  a2(t,*,*)  is  the  unicue 


solution  of 
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ft  ft 

^2^’si  »s2^  ”  (^>^1  >s2'  "*"  I  j  ,rCt,s1  > s 2 * ^ l  *-2Ja2vt,~1  »~2^c2<"c1 

(4.51) 

where 


^-j  (t»S-j  >S2)  ~  F(t.S-j  ^2)  ~  Cq ( t » S2 1  J-j ) r  ( t , Si  » o' 2 ( t  , S-j o ^ ) r ( t »®i  » S2 )  1 

dc^da-j 


^  j  Q  ^ ^  ^ )  Q  ( ^  >  ^  2  >  ®  2  ^ r  ( ^  ’  ^  ]  *  ^  2  ^  2 dcr , 


y(t,s^  *s2»ai  >a2^  — ^'](^,^'],^21<^1’i^2^  ”  j  d(^»s>u)Y^(u,5p,j^,w2)du 


r-j  ( t , s i  >s2>0']  >cr2^  =  ”  r2 ^ ^ ^  1  ,s2’c1  ,c2^  ~  L,^1  ,sr2^ 

t 

+  q(t,S2»u)r2(t,s1  ,u  .c^o^du  . 

0 

Proof  From  theorem  4.5,  it  suffices  to  show  that  I  +  R-j(t)  is 
invertible  and  that  1  is  not  an  eigenvalue  of  R(t).  The  invertibility 
of  I  +  R^(t)  is  proven  in  lemma  4.5.  The  eigenvalue  condition 
is  a  consequence  of  the  uniqueness  of  the  best  quadratic  estimate. 

For  suppose  that  c(s^,s2)  =  (R(t)c)(s^  ,s^) .  Then  a^t.s^.Sg)  = 
a2(t,s-j,S2)  +  cCs-j^)  would  also  be  a  solution  of  (4.44),  and  hence, 
if  2g(t)  and  aj(t,s)  were  defined  from  a^t.s-pSg)  via  (4.43)  ana 
(4.41a),  a^(t)  ,a^  (t,s)  ,a2(t,s-|  ^2)  would  also  satisfy  the  optimal 
kernel  equations.  This  contradicts  the  uniqueness  of  the  best 
quadratic  estimate.  The  definitions  of  and  v  follow  from 
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the  proof  of  theorem  4.5.  Finally,  note  that  if  y  denotes  tne 
integral  operator  defined  by  y,  I  +  7  is  invertible  and  hence  (4.51; 
has  a  unique  solution. 

We  have  thus  reduced  the  complicated  kernel  equations  (4.40) 
to  a  simple  Fredholm  equation  which  can  be  solved  by  standard  methods. 
Moreover,  we  can  achieve  a  similar  result  for  the  problem  of 
determining  the  best  quadratic  correction  to  a  linear  filter,  which, 
as  previously  mentioned,  requires  solving  (4.40c)  for  a2  -'n  terms 
of  and  a-j .  Again,  (4.40c)  is  an  equation  of  tensor-contraction 
type  for  a2  and  the  tensor  contraction  kernels  are  the  same  as  in 
(4.44): 

a2(t,s1,s2)  *  [Ef(t)h(s1)h(s2)  -  aQ(t)Eh(s1 )h(s2)  -  a] (t,s] )Eh(s2) 

ft 

-  a1(t,s2)Eh(s1 )  -  j  a1(t,ff)Eh(ff)h(s1)h(sz)de] 

’0 

t 

"  j  Cri  (s2>o)a2(t,s-j ,-)  r^  (s-jC7)a2(t,s2,c)]dc 

0 

t 

* 

E{h(s1  )h(s2)h(a-|  )h(a2) }  *  a2(t,c1  .j^dc^d^ . 

0  0 


Note  that  this  method  does  not  succeed  in  solving  the 
optimal  kernel  equations  recursively.  Rather,  t  is  fixed  through¬ 
out  and  the  relevant  operators  are  defined  and  inverted  on 
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process  would  be  repeated.  A  recursive  solution  would  use 
a^t.s^s^  to  construct  a2(t+dt,s.j ,  ) . 

An  important  problem  is  to  determine  conditions  on  f,  h 
and  the  signal  process  x(-)  such  that  a^(t,s)  and  a2(t,s<,s2) 
are  separable,  for  in  this  case  the  filter  can  be  constructed  with 
stochastic  differential  equations  (see  theorem  2.3'.  This  has  not 
yet  been  done  and  is  fairly  complicated  due  to  the  complex  manner 
in  which  Ef(t),  Ef(t)h(s),  etc.  combine  to  produce  the  kernels 
F-|  and  y  of  (4.51),  the  Fredholm  equation  for  a?. 


t 
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CHAPTER  5  NONLINEAR  FILTERING  PROBLEMS  WITH  FINITE  DIMENSIONAL 

ESTIMATION  ALGEBRAS 


Suppose  that  a  signal  x(t)  is  defined  by  the  stochastic 
differential  equation 


dx(t)  =  f(x(t))dt  +  g(x(t))db(t) 


x(0)  s  xr 


(5.1) 


and  that  it  is  observed  via 


dy(t)  a  h(x(t))dt  +  dw(t) 


(5.2) 


As  usual,  b(t)  and  w(t)  are  assumed  to  be  independent  Brownian 
motions.  The  filtering  problem  associated  to  (5.1)  -  (5.2)  will 
be  completely  solved  if  the  conditional  distribution,  P(x(t) cA ; F^) , 
A  t  {3orel  sets  of  state  space),  is  known.  A  basic  Question  is: 
when  can  P(x(t)eA;cp,  as  a  measure,  be  characterized  by  a  finite 
set  of  statistics  propagating  recursively  in  time? 


Recent  progress  on  this  issue  has  come  from  several 
directions.  First,  V.  Benes  [1  j  has  proved  the  following  result 
by  probabilistic  methods.  Suppose  g  =  constant,  h(x)  =  ax  +  2,  and 
f  is  a  global  solution,  (i.e.,  defined  on  all  of  ?'  of 

f’  A  f2  =  ax2  *  bx  *  c  .  (5.2' 


"hen  the  conditional  distribution  o*  <(t;  gi  /en  has  a  tensity 


which  can  be  expressed  in  terms 


3  r  *  r,  -  ’  e  m 3* 


3  .Z.  -  •  i  V-  -  J 
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generated  by  Kalman-Bucy  type  filtering  equations. 

There  is  also  a  suitable  generalization  to  the  case  of  vector 
signals  and  observations.  This  result  covers  the  case  treated  by  the 
.Kalman-Bucy  theorem,  (lemma  3.1),  but  it  gives  new  examples  of 
finite  dimensionally  computable  filters  as  well.  We  will  refer  to 
the  filtering  problems  treated  in  Senes'  theorem  as  the  "Sene's 
examples". 

Secondly  Brackett  and  Clark  [7  ],  Witter  [35],  and 
Brockett  [5  ,6  ]  have  suggested  that  Lie  algebraic  techniques 
can  be  applied  to  the  nonlinear  filtering  problem.  They  show  hew 
to  associate  a  Lie  algebra  of  operators,  the  so-called  estimation 
algebra,  with  filtering  models  such  as  (5.1)  and  (5.2)  and  how  the 
Lie  algebra  structure  bears  upon  the  filtering  solutions.  In 
particular,  they  suggest  that  when  the  estimation  algebra  is  finite 
dimensional  it  may  be  possible  to  compute  conditional  densities 
fini te-dimensionally . 

In  this  chapter,  we  will  pursue  the  imDlications  of  Lie 
algebraic  techniques  for  exact,  finite  dimensional  calculation  of 
conditional  densities.  The  first  section  will  sketch  the  basic 
ideas  of  this  theory,  especially  those  that  concern  generating 
filter  solutions  from  the  estimation  algebra  structure.  As  part 
of  this  exposition,  we  will  derive  oy  bie  algebraic  taerniques  the 
conditional  density  for  the  orcolem  of  estimating  a  3rownian 


-  m 


motion  in  white  noise.  The  solution  to  this  problem  is,  of  course, 
well-known.  We  derive  it  here  not  for  the  end  result  but  to  demon¬ 
strate  and  explore  a  Lie  algebraic  technique,  called  the  Wei -Noman 
method,  (Wei  and  Norman  [44]),  that  establishes  the  connection  between 
finite  dim.  estimation  algebras  and  finite  dim.  filters.  Further,  a 
rigorous  derivation  of  a  filtering  solution  directly  from  the  algebraic 
structure  has  not  appeared  in  the  literature  for  diffusion  signals 
and  so  we  present  one  here.  As  another  part  of  this  treatment,  we 
will  indicate  connections  between  the  Lie  algebra  strategy  and  the 
theory  of  Lie  algebra/Lie  group  representations  on  infinite  dimensional 
vector  spaces  (see  also  3rockett  [6]).  This  will  provide  us  insights 
into  the  behavior  of  the  filtering  problems  we  consider.  Also,  Lie 
algebraic  theories  of  estimation  are  presently  very  incomplete  and 
we  believe  representation  theory  will  ultimately  offer  much  to  their 
study.  Finally,  we  briefly  develop  the  Benes  examples  from  the  Lie 
algebraic  viewpoint,  a  possibility  first  realized  byMitter  ,  who 
suggested  it  to  us,  after  Benes'  results  became  known. 

The  remainder  of  the  chapter  is  devoted  to  the  search  for 
new  examples  that  may  be  solved  Lie  algebraical ly .  Section  2  presents 
a  case  to  which  the  methods  developed  in  this  section  do  not  apply, 
and  it  explains  why  not.  Section  3  contains  the  principal  results 
of  the  chapter.  It  first  gives  necessary  conditions  for  a  general 
class  of  vector  process  models  to  have  finite  dimensional  estimation 
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algebras,  since  these  are  the  models  that  could  possibly  be  solved 
by  the  algebraic  techniques.  By  applying  this  result,  we  are  able 
to  list  all  possible  problems  with  finite  dimensional  estimation 
algebras  for  scalar  process  models.  We  then  ask  the  question:  for 
which  of  these  examples  does  the  Wei -Norman,  Lie  algebra  calculation 
work?  Our  results  are  largely  negative.  Roughly  speaking,  they 
indicate  that  only  for  the  previously  known  examples  coes  the 
calculation  work. 

5.1  Estimation  Algebras  and  Filtering 


Our  concern  henceforth  shall  be  with  the  filtering  problem 
stated  in  (5.1)  -  (5.2).  Suppose  that  for  all  t,  x(t)  has  a  density 
q(x,t).  Then  the  conditional  distribution  of  x(t)  given  F^  has  a 


den si ty 

p(x,t!Fp 


E0':Lt ! Fr ’  x(t)sx^(x>t) 


r  <;'J 

-olLt 


ri; 


(5.3) 


This  is  easily  derived  from  the  ;<al  1  ianpur-Striehel  formula.  Call 
the  numerator  of  (5.3)  p(x,t’r^).  p(x.t'F^)  captures  the 

x-dependence  of  p(x,t'cp,  that  is,  it  equals  p(x,t,Fp  up  to 
a  random  normal ization  factor,  and  it  is,  therefore,  called  the 
unnormalized  conditional  density.  o(x,t;F^)  is  easier  to  work 
with  tnan  p(x,t'F^). 

1  2, 

Let  L*  *  j  g-(:<)  -  t—  f(x;  oe  the  for-varo  generator  of 

£  —  .‘A 

jX 

x(  t) .  'Jnder  aocrooriate  regular;ty  cordicirns,  lake' 


Pardoux  [40]) ,  the  unnonnal i zed  conditional  density  p(x,t'F^)  defined 
in  (5.3)  satisfies  the  stochastic  partial  differential  equation 

dp(x,t)  =  L*p(x,t)dt  +  h(x)p(x,t)dy  ^ 

p(x,0)  =  Pq(x)  =  initial  density  of  x(0). 

The  Stratonovich  and  white  noise  forms  of  (5.4  )  are,  respectively, 

dp ( x, t )  =  [l*-  -^(x)]p(x,t)dt  +  h(x)  p(x,t)dy  (5.5) 

and 

=  CL*-  •jjib^(x)  ]p(x,t)  +  h(x)p(x,t)y(t).  (5.6) 

In  (5.5),  cT  denotes  the  Stratonovich  differential.  (5.6)  is  a  formal 
expression  because  y ( t )  does  not  exist  except  in  a  generalized 
sense  but  it  is  useful  in  calculations.  (Mote:  As  in  (5.4)  - 
(.5.6),  the  yO)  dependence  of  p(x,tjFp  will  often  be 
suppressed  for  rotational  convenience.)  All  or  any  of  these  equations 
will  be  referred  to  as  Zakai's  equation,  (Zakai  [47]).  For  the  Lie 
algebraic  theory,  it  is  necessary  to  work  with  (5.6),  since 
manipulations  involving  y(t)  obey  ordinary,  rather  than  Ito 
calculus,  and  the  Lie  algebra  results  to  be  adapted  were  developed 
for  deterministic  problems  with  ordinary  calculus. 

The  precise  question  that  we  will  study  here  may  now  be 
stated.  When  can  p(x,t)  be  characterized  by  a  finite  number  of 
statistics  propagating  in  time;  in  other  words,  when  does  p(x,t) 
evolve  on  a  finite  dimensional  manifold?  The  new  approach  to 
filtering  that  we  deal  with  here  is  to  learn  about  p(x,t)  by 
applying  lie  algebra/uie  group  methods  to  Zakai's  eouaticn.  Let 
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Lq  =  L*  -  ^  h“(x)  and  let 

{LQ.h>LA  denote  the  lie  algebra  of  operations  generated  by 
h  using  the  bracket  operation  [A, 3]  =  AB-BA.  After  Brockett 
we  ca  11  CL0>h;LA  the  estimation  algebra. 


L-,  and 
0 

C5  3. 


The  fundamental  idea  is  that  {Lg,h}^A  carries  information 
about  the  infinitesimal  behavior  of  p(x,t).  In  particular,  if 
p(x,t)  evolves  on  a  finite  dimensional  manifold,  then  { Lq , h >,  A 
ought  to  be  finite  dimensional  as  a  consequence  of  the  interplay 
between  Lie  groups  and  Lie  algebras,  (Brockett  [5]).  Thus,  we  can 
search  for  finite-dimensionally  solvable  filtering  problems  by  looking 
for  examples  in  which  dim{Lg,h}^A  ^  =.  This  will  become  clearer 
in  the  subsequent  discussion. 


Example  1 :  (Brockett  and  Clark  [7],  Mitter  J25]). 

Consider  the  simplest  case  covered  by  the  Kalman-Bucy  theorem 
(see  1 emma  3.1). 

x(t)  =  xn  +  b(t) 

0  (5.8) 

dy(t)  =  x(t)dt  +  dw(t) 

where  Xg  is  a  random  variable  independent  of  the  orocesses  b(t) 
and  w(t).  For  (5.3),  Zakai's  equation  is 

^  2 

■§£  p(x,t)  =  (j  7^2  -  ^x2)p(x,t)  +  y(t)xp(x,t) 

(5.9) 

p(x,Q)  =  density  of  xQ  . 

The  correspcndi ng  estimation  algebra  is  easily  seen  to  be 
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A  = 


-  ,1  d‘  12  d  T  - 

Span  ^  ^  x  ,  x,  jyj  1  > 

c  dx"  d  a 


Define  A-  =  m- 


1  d 

2 


dx 


1  2 

'2  “  2  X  * 


The  commutation  relationsof 


are 


^  =  dx 

£A0’  *  x 
C^,X]  =  I  . 


(5.10) 


We  remark  that  A  is  solvable.* 


Estimation  Algebras  and  Solutions 

In  exploring  the  interaction  between  finite  dimensional 
estimation  algebras  and  finite  dimensionally  computable  p(x,t)  we 
must  first  confront  the  question:  Given  a  finite-dim.  estimation 
algebra  how  does  one  integrate  it  to  get  a  solution  of  (5.6),  i.e., 
how  does  one  determine  p(x,t)  from  the  algebra  structure?  This 
problem  relates  naturally  to  the  theory  of  integrating  representations 
of  Lie  algebras  of  unbounded  operators  on  a  Hilbert  space  to  a 
corresponding  representation  of  a  Lie  group.  We  present  seme 

★ 

A  lie  algebra  G  is  solvable  if  the  series  of  ideals  G-.  =  [G,G],---, 

Gn  =  [Gn_-j  ,Gn_i  ] ,  •  •  ‘  terminates  ac  0  for  some  n  <  =. 

I  CG  is  an  ideal  if  [1,5]  I. 


i 
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ideas  of  this  theory  to  motivate  the  main  calculations!  method, 
and  because,  as  mentioned  above,  we  find  it  a  useful  source  of 
concepts  for  thinking  about  estimation  algebras. 

Let  G  be  a  finite-dim.  lie  algebra  and  G  its  associated 
(simply  connected)  group.  Let  H  be  a  complex  Hilbert  space. 


Definition  5.1 

A  representation  I  of  G  on  H  is  a  map  T  from  G 

onto  a  set  of  linear  operators  on  H  with  a  common,  dense,  invariant 
domain  D  such  that  [T(x),T(y)]  =  T(!x,y])  for  all  x,y  -c  G.  Like¬ 
wise,  a  representation  -  of  G  on  H  is  a  map  t:G  -*»  L (H )  = 

bounded  linear  operators  on  H  such  that  t(g-j  g9)  -  t( g- ) t (g^)  for 

grg2  s  G. 


The  problem  of  integrating  a  Lie  algebra  representation 
to  a  group  representation  is  as  follows.  Given  a  representation 
of  G  on  H,  when  does  there  exist  a  group  representation  t  of  G 
such  that  t(etx)  =  etlvX^  Y  x  e  G~  Here  et,vX^  is  a  group 

A 

generated  by  T(x)  in  the  sense  that  e"  :  =  T(x)j  v  ?  sO. 
Suppose  that  x-j ,  ‘  * '  ,x^  is  a  basis  for  G  and  that  we  have  groups 
e<'T(xi ) , i=l , , , ,  ,d.  A  method  for  constructing  r  locally  is  to 


define 


"lxl . . 


■  ~r  x  .1 
d  v  c' 


Formal ly ,  this  can  be  made  to  work,  if  the  oceratcr  identitv 
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e 


X. 


1 


30 


v 


tx. 

1 


holds  for  X.  s  T(x.),  1  <  j,  j  <  d.  (FI  ato,  et  al .  £12  1). 

J 

(T)  will  be  of  chief  importance. 


The  procedure  recalls  the  Wei -Norman  [441  technique  for 
solving  differential  equations.  Let  us  develop  this  formally  for 
the  above  situation. 


eouation 


Suppose  that  'in  H  we  want  to  solve  the  evolution 


do 


^  =  X]P  +  u(t)X2?  (5.12) 

which  is  similar  to  Zakai’s  equation.  We  try  a  solution  in  t(G)'» 


g1(t)X1  g  (t)X, 
p(t)  =  e  1  1 • * *e  d  dpC0)  . 


For  thi s  p(  t) , 


(5.13) 


Ht  =  M-t)XlP 

g1(t)x]  g2(t)x2  gd(t)x. 


g2(t) 

gd(t) 


X2e . 1 

gjtjx,  gH(t)xd 

e1  1---Xded  dp(0) 


p(o)  +  -  ’  ’ 


(5.14) 


From  (4),  for  1  <  i,  j  <  d 


*[aaA[3 


r : 


n  1 


I  . -4 
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tX.  d  .  .  tX. 

e  JX.  =  )  c1  ,J(t)X  s  J 

m=1 

and  applying  this  repeatedly  in  (7), 

F1(g(t),g(t))X1p+-*-+Fd(g(t),g(t)).XdP  (5.15) 

for  some  non-linear  functions  F-  of  g(t)  *  (g-j(t)  ,*  *  •  ,gd(t))  and 
9(t).  For  p(t)  to  solve  (5),  F.(g(t) ,g(t) )  =  1, 

F2(g(t) ,g(t))  =  u(t)  and  F ^ ( g ( t ) ,  g(t))  =  0  for  j  >  2.  Solving 
this  set  of  equations  (locally  in  t)  for  g(t)  gives  a  local 
solution  of  (5) . 

We  will  use  this  method  to  solve  a  filtering  problem,  but 
first  we  present  a  few  more  remarks  on  Lie  algebra/Lie  group 
representation  theory.  The  heuristic  ideas  contained  in  (5.11)  and 
(I)  have  been  worked  into  a  rigorous  theory  by  Nelson  [37]  and 
Flato,  et  al.  p3]  for  the  situation  in  which  G  is  represented 
by  skew- symmetric  operators  and  7  is  required  to  be  unitary  (i.e., 
to  take  values  in  the  space  of  unitary  operators).  Their  results 
involve  heavy  use  of  the  notion  of  analytic  vectors  (Nelson  [37]); 
see  appendix  2  for  a  definition.  In  their  theory  7  will  exist 
if  the  algebra  domain  D  contains  a  dense,  invariant  set  of  vectors 
analytic  for  each  element  of  a  basis  for  7(G) .  Conversely,  if  7 
exists  there  is  a  common,  dense  set  of  analytic  vectors  for  the  whole 
lie  algebra  (see  Flato,  at  al.  [13]).  Further  implications  of  the 
theory  are  revealed  in  the  following  examples, which  play  a  role  in 


■  •  s* jbiu 
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the  later  discussion. 

Example  2  Let  =  {-  ix,  ~,-i>  and  specify  its  domain  as 

S  =  rapidly  decreasing  functions 

*  {(j»  e  c  (  1R)  j  sup  |x^3axp(x)(  <  “V  c^S  s  Z  } 
x 

Ai  is  then  a  representation  of  the  so-called  Heisenberg  algebra  on 

2  2 
L  (R)  and  it  does  generate  a  Lie  group  on  L  (!R).  It  is  not 

important  here  to  present  this  group.  However,  it  is  interesting 

to  construct  a  domain  of  analytic  vectors  for  a,  because  this 

1  j  2  1  2 

involves  the  second  order  operator  AQ  =  ;=r(~j^)  +  ?r{-ix)  = 

1  d^  1  2 

-5- — 7  -  tc  x  ,  which  arose  in  the  estimation  alaebra  A  of  example  I. 
i  dx^  d  2 

Indeed,  it  turns  out  that  AQ  on  Ld(IR)  has  a  discrete  spectrum 

{X  }“  1 ,  X  <0,  lim  X  =  Let  }"  ,  be  the  corresponding 
n  n- 1  n  _  n  n  n- 1 

n-*ui 

eigenvectors,  and  define 
N 

O'  =  {  |  ani>n|N  <  *} 

O'  is  a  dense  invariant  set  of  analytic  vectors  for  Aq  and  Ag. 

It  is  easily  seen  that  D‘  is  also  a  dense  set  of  analytic 
vectors  for  A  of  example  1. 

Example  3  Let  ^  =  {-ix,  -i}  be  a  representation  of  the 

O  +  y,  X  y)  X 

Heisenberg  algebra  on  L  (R  )  with  domain  Cq(R  )  =  {?eC  (R  }:?  has 

4. 

compact  scpoori  in  R'}.  In  phis  case,  a  unitary  representation  - 
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generated  by  A,  does  not  exist.  The  reason  is  that  i  —  will 

not  be  essentially  self-adjoint  on  Cq(IR')  (see  Appendix  2),  and, 

by  a  theorem  of  Nelson  [37],  this  precludes  t.  Nelson  [37]  also 
2  2 

shows  that  i  -  i  x^  =  i(4~)  +  y(-ix)^  will  not  be  essentially 

dX  .  gp  ^ 

self-adjoint  on  c”(!R  )  and  that  Cq(  1R  )  will  not  contain 
a  dense,  invariant  set  of  analytic  vectors  either.  We  shall  observe 
analogous  behavior  for  the  filtering  problem  studied  in  section  2. 


We  will  now  adapt  the  Wei -Norman  method  of  (5.12)  -  (5.15) 

to  the  solution  of  (5.S)  in  example  1.  The  first  step  is  to 

tx 

associate  evolutions  e  to  the  elements  X.  of  the  estimation 

algebra  A.  However,  for  A,  the  situation  is  considerably  more 

complicated  than  in  the  theory  of  representations  by  skew-symmetric 

operators.  It  will  no  longer  always  be  possible  to  generate 

groups  with  the  elements  of  A,  or  to  insure  that  the  evolutions  are 

bounded.  Nevertheless,  we  proceed  with  the  most  natural  definitions. 
tAQ 

Define  (e  | t>0 }  to  be  the  semigroup  associated  to  A^  by 

solving  in  L  (]R)  the  equation 

u(t)  =  AQu(t)  u ( 0 )  *  $  s  L2GR)  . 


It  is  well-known  that 
tA„ 


(e  o)(x)  =  jG(x,y,t)?(y)dy  t  _>  0 


5.16} 


_  T  /?  1  ?  ? 

G(x,y,t)  =  (2-sinht)  ~  exp[-  j  (cotht)(x  +y  )  +  xy/sinnt[ 


1 
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Similary,  define 

(etX^)(x)  =  etx?(x)  *  ttS 

ti- 

A  v 

(e  $)  (x)  =  $(x+t)  t  e  3R  . 


As  in  (5.13)  let  us  try  to  solve  (5.9)  by  the  expression 


sd 


p(x,t)  =  [e 


9l(t)AQ  92(^)x  93^/' ax  34,(0 

a  0  0  n  _  l 


'5. 17) 


where  the  g.(t)  functions  are  to  be  detemined.  One  may  certainly 
1  92(t)x 

raise  objections  to  (5.17)  —  the  operator  e  is  unbounded, 

g^(t)  cannot  take  negative  values  --  but  these  will  be  cleared  up 

as  we  go  along.  The  Wei -Norman  method  will  allow  the  values  of 

g.(t)  to  be  calculated  if  (I)  holds  for  the  elements  of  A. 

This  requires,  for  example,  that 


e  °x*  =  (cosht)x  e  +  (sinht)^-  e 


(5.18) 


e  0  ^  ?  =  (sinht)x  e  °j  +  (cosht)^  e 


'  c 


5.19) 


and 


„tx  d  .  _  d  tx  .  tx 
e  It  ?  ’  d7  e  *  "  1  e  * 


(5.20) 


The  right-hand-sides  of  (5.13)  and  (5.19)  are  derived  from 
(I)  by  using  tne  identities 

r  .  ,  - 2n 

[ao  A-j  x  =  x 


his  5 0  •  v e 3  ij v  t ,  ~  xu',  t)  u;0  -  =  2 ,  b u t  “  in  l  ,T. ) . 
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r,  .  .  -i2n+l  d 

[ad  A0]  x  dx 

which  follow  easily  from  the  commutation  relations  (5.10). 

Lemma  5.1 

(i)  (5.20)  holds  for  every  $  e  C^(]R) 

(ii)  Let  V  =  (d(x)  =  x3ea  X’d(x)  |3  .csdR.-j;  e  L^(R)}.  For  every  3  £  7 

tA0 

(e  4>)(x)  =  G(x,y,t)o(y)dy 
]R 

exists  and  is  infinitely  differentiable  on  {(x,t)jt>0;.  Further 


„m+n  tAn  ,  .m+n 

-Vn  ^  °*)(x>  =  f-Vn 

3tm3x  }  3t  3X 


G(x,y,t)sp(y)dy 


(5.21) 


3  tAn  tAn 

and  for  t  >  0,  ^  (e  U*)(x)  =  (AQe  ?)(x) 


(5.22) 


(iii)  (5.18)  is  true  for  every  <3  c  V,  (5.19)  for  every  3  s  V 
such  that  i '  £  V. 

1  92(t)*  33(t)4r  g4(t) 

Remark  If  Pg  £  L  (JR)  then  e  c  eJ  xe  p^  c  7 

and  hence  p(x,t),  as  given  by  the  product  of  evolutions  (5.17),  is  a 

well-defined  function  in  J  (IR). 


Proof  (i)  is  an  elementary  calculation. 


(ii)  can  be  derived  easily  using  the  explicit  form  of  3(x,y,t;. 


The  principal  observation  to  make  in  coinc 

2 

for  t  >  0,  G(x,y,t)  will  decay  like  e"''/ 


as  y 
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(iii)  The  following  calculation  proves  (5.13) 


tAn 

(e  ux<i)(z)  =  ;G(z,y,t)y$(y)dy 


G(*.y.t)L_-,+(cotht)x  G(x,y,t)  !„„  ] 


x=  z 


x=z 


sinht  j>(y)dy 

,  tAn  tAn 

=  (sinht  —■  e  °*)(z)  +  (cosht)(xe  ?)(z) 

To  get,  (5.19)  integrate  by  parts 

(e  0  ~*)(z)  *  [s(zfy,t)^  ?(y)dy  *  -  !  [^  G(z,y,t)]*(y)dy 

tAn  i  tA- 

=  (cotht) (e  °xc p)(z)  -  |^t(xe  ?)(z) 

tAn  a  tA n 

=  sinht  (xe  Ud)(z)  +  cosht  e  ?)(z). 

To  obtain  the  last  equality,  we  used  (5.13). 


Let  p(x,t)  be  as  in  (5.17).  We  will  now  solve  for  the 

m  (5.17) 

g,(t)An  g,(t)x  g,(t)i-  cAt) 


functions  g^(t)  i  =1,2, 3, 4.  From  (5.17) 


Tz-  =  g,  ( t) A_p  -  g-(t)  e 


x  e 


e'J  ax  e 


g,(t)x^ 


+  g3(t)  e  1  0  e 

x  g4(t)p 


4 

cx 


'  _±_  r-  !  *  ' 

■'CX  =4'w/ 


CJ1 
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By  applying  (5.13)  -  (5.20),  one  derives 


3t  =  ^1 


F2(g(t),g(t))xp(x,t)  +  F3(g(t),g(t))^p(x,t) 


+  F4(g(t),g(t))p(x,t) 


(5.24) 


with  F i s  given  below  in  (5.25).  But  if  p(x,t)  is  to  solve  (5.9) 
with  p(x,0)  «  Pg(x)  we  must  require 


g-j  ( t )  =  1 

y(t)a  F^g(t),g(t))  =  g2(t)cosh  g1  (t )  +  g3(t)sinh  g-j(t)  (5.25) 

0  =  F3(g(t),g(t))  =  g2(t)sinh  g^t)  +  g3(t)cosh  g^t) 

0  =  F4(g(t) ,g(t))  =  g4(t)  -  g3(t)  g2(t) 
g.j(t)  -  0  i  3  1 *  ,4 

(5.25)  may  be  easily  solved.  The  result,  written  in  terms  of  the 

dy(t)  notation  rather  than  y(t),  is 

t 

g2(t)  =  J  cosh(s)  dy(s) 

0 


t 

g3(t)  =  -  |  sinh(s)  dy(s) 
0 


g4(t)  = 


t  t 

(sinhs) (coshs)ds  -  f  g2(s)(sinhs)dy(s) . 
0  0 


(5.25) 


Finally,  by  substituting  these  expressions 
explicit  form  of  the  kernel  G(x,y,t)  of  e 


(5.17)  and  using 
we  derive 


tne 


-  125  - 


P(x,t) 


30 

j  k(z,u} 

-CO 


(t) [x-m(t)] 


2 

P0(2)dz 


(5.27) 


p(t)  =  tanht 


t 

. 

0 


sinhs 

cosht 


dy(s)  . 


Let  p(x,t;z)  denote  the  integrand  of  (5.27)  exclusive  of  pQ(z). 
p(x,t;z)  may  be  interpreted  as  the  unnormalized  conditional  density 
of  x(t)  for  the  process  x(t)  starting  at  x(0)  =  z.  It  is  clear  that 
the  normalized  version  of  p(x,t;z)  is  a  Gaussian  density  with  mean 
m(t)  and  covariance  p(t),  and  this  agrees  with  the  Kalman-3ucy 
solution  of  the  problem.  Though  these  calculations  used  the 
y(t)  formalism,  they  can  be  carried  out,  with  some  added 
computational  complexity,  using  the  rigorous  Ito  calculus.  Therefore, 
we  have  redertved  the  Kalman  filter. 


Remark  In  presenting  the  Wei -Norman  technique,  it  was  indicated  that 

in  general  it  only  gives  solutions  local  in  t.  However,  theorem  5.1 

provides  a  solution  for  all  t  >_  0.  This  happens  because  the 

estimation  algebra  A  is  solvable  (see  example  1).  Wei  anc  .Norman 

9.-(t)X. 

show  that  for  solvable  matrix  Lie  algebras,  if  the  operators  e 

are  placed  in  the  correct  order  in  (5.13),  global  solutions  can  be 

found.  Without  further  elaboration,  we  observe  that  this  **esul: 

9n ( t ) A- 

extends  to  the  present  case  and  motivates  putting  e  first 

in  (5.17). 
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The  3ene?  Example 

The  tie  algebra  strategy  is  also  able  to  recover  the  theorem 
of  V.  Bene^  quoted  above.  Consider  the  scalar  case 


dx(t)  =  f(x(t))dt  +  db(t) 
dy(t)  =  x(t)dt  +  dw(t) 

f’  +  f2  =  ax2  +  bx  +  c  .  (5.29) 

The  Zakai  equation  is  then 


3p  _  ,1  3 
3t  "  '2 

a  a 


j  ~  f(x)  -  ^  x2}p(x,t)  y(t)xP  . 


:  5.30) 


'  i  '  2  I  ^ 

The  estimation  algebra  A.  -  ij  ~  f(x)  -  j  *  .*;LA  a9ain 

3X 


1 


finite  dimensional.  Indeed,  if  l0  =  j  -  tt  f(x)  -  7  xS 

C  3x 


3X 


1  2 
2 


A  =  Span-.L0»x,dx  -  *,Io 
and  its  commutation  relations  are 

^L0,x^  =  dx  -  r 

[Lq,^  -f]  =  (a+1  )x  +  |  (5.31) 


These  calculations  are  valid  without  restriction  on  the  coef*ici ents 
a,  b,  and  c.  However,  for  general  a,  b,  c,  (5.29;  may  net  have  a 
global  solution,  in  which  case  (5.31)  is  to  re  inte^cretec  on  -unctions 
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whose  domains  are  contained  in  the  region  where  r  is  defined.  We 
will  explore  when  (5.29)  has  global  solutions  in  section  5.3. 


We  could  now  try  to  solve  the  Zakai  equation  (5.30)  by 


p(x,t)  =  (e 


9,(t)L0 


92( t) x 


i^ax 


94C  t) 


P0)(x,t) 


However,  it  is  simpler  to  first  rewrite  Zakai 's  equation  via 
a  simple  transformation  that  compares  to  the  gauge  transformations 
of  quantum  physics.  Let  z  z  7.  and  define 
x 

F(x)  =  <  f(s)ds 


If  q(x,t)  *  e~F^x'p(x,t) ,  a  substitution  in  (5.30)  demonstrates 
that 


3q  _  rj_  3 _ 

3t  " 12 17 ' 


■jCCa+l  )x2+bx+c]]q 


y(t)xq 


/  c  32  > 


(5.32)  is  similar  to  the  Zakai  equation  for  the 

Brownian  signal  example.  Although  (5.32!  is  not  the  Zakai 

equation  for  a  filtering  problem,  let  us  define  its  Lie  algebra 

o 

operators  as  A  =  (L^x}^  L^  =  j  —  ^  -  j  [(a+1  )x~+bxfc]. 

3  X 

A  is  isomorphic  to  A  by  the  ismcrchism  L^  -  Ln,  x  -  x, 

U  J 

d  d 


o* 


dx 


dx* 


i . 


It  is  trivial  to  observe  that 


A  =  Soanvp 


(3-1 


3  ~  2  2 

;a 


X  , 


(j  X 


L 
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Suppose  that  f  is  globally  defined  so  that,  in  order  to  find 
p(x,t),  we  seek  a  solution  q(x,t)  of  (5.32)  that  is  defined  for  all 
x  e  E.  Suppose  further  that  (a+1)  >  0.  We  can  then  solve  (5.32) 
by  the  method 


q(x,t)  =  (e 


9-|(t)AQ  g2(t); 
e 


,(t) 


q0)U) 


The  g.(t)  functions  will  be  slightly  different  from  those  calculated 
for  example  1  because  of  the  different  commutation  relations  between 
Ao,x’ix  and  resultin9  solution  p(x,t)  =  er^q(x,t)  is 

precisely  that  obtained  by  Benes.  Whether  the  same  method  can  be 
made  to  work  for  the  case  in  which  f  is  not  globally  defined  is 
an  issue  we  will  take  up  in  the  remaining  sections. 


5.2  Estimation  of  Absorbed  Brownian  Motion 

This  section  counterpoints  the  successes  of  5.1  by  providing 
an  example  in  which  the  estimation  algebra  has  finite  dimension,  but 
in  which  the  W’ei-Norman  calculation  does  not  work,  'he  signal  in 
this  example  is  a  3rownian  motion  absorbed  at  a  boundary,  and  the 
associated  filtering  problem  is  very  similar,  operator-wise,  to  the 
problems  treated  in  5.1.  3ut  it  turns  out  that  the  Wei-Norman 
met.nod  cannot  proceed  because  t.ne  identity  (I)  fails  for  tn.e 
estimation  algebra  operators,  'he  pnenemcnology  of  this  failure  is 
discussed  in  ncoe  o*  characterizing  the  estimation  a'gebras  ‘‘or  wnicn 
tne  tecnnioue  dees  or  dees  rct  .vcrx.  ror  t.ne  examoles  at  ^anc. 
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crucial  information  is  revealed  by  the  algebra's  behavior  on  the 

1  2 

eigenvectors  of  the  partial  differential  operator  Lg  =  L*  -  j  h 
(see  5.6)  of  Zakai's  equation  and  by  the  interaction  between  operators 
and  boundary  conditions.  The  domain  structures  of  the  estimation 
algebras  are  also  more  closely  identified.  An  important  role  is 
played  by  the  existence  or  non-existence  of  a  dense  invariant  domain 
of  analytic  vectors  associated  with  the  eigenvalues  of  Lq. 


Let  b(t)  be  a  Brownian  motion,  let  Xg  be  a  r.v.  with 
density  Pg(x)  such  that  Pg(x)  =  0,  x  <_  0  and  such  that  Xg  is 
independent  of  b(t),  and  let  r  =  inf{t Jxg+b(t)  =  0}.  Consider  the 
problem 


X(t)  «  (Xg+b(t))  l{t<T} 

dy(t)  =  x(t)dt  +  dw(t) 


(5.33) 


x(t)  is  an  absorbed  Brownian  motion  with  random  initial  value.  The 
distribution  will  now  have  two  parts;  an  atom  Qg(t)  =  Pa(x(t)=0}, 
and  a  measure  Q(A,t)  =  Pa{x(t)cA-{01)  (for  AC  [0  ,»))  with  density 
q(x,t).  Accordingly,  the  unnormalized  conditional  density  of  x(t) 
will  have  two  parts: 


and 


Pa(t)  • 


Entl 


,LJ.F i 


0 1  (x(t)=0)  t !  t ' 


P(A,t) 


E0{1 


{x(t)£A-{orLt 


P(A,t)  will  have  a  density 
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P ( x » t )  =  q(x,t)  E0{l{t<x-kexp  [j  (xQ+b(s))dy(s) 

vo 


'If  (xQ+t>(s))2ds]  [F^,  x(t)  =  x} 


(5.34) 


Again  these  statements  are  all  consequences  of  the  Kal 1 ianpur-Strieoe 
formula.  Assuming  differentiability,  p(x,t)  will  satisfy  the 
Zakai  equation 


^  =  ~  \  x2)p(x»t)  +  y(t)xp(x,t) 

3  X 


'5.35) 


p(0,t)  =0  t  >  0 
p(x,0)  =  pQ(x) 


(5.35)  may  be  derived  formally  as  follows.  Let 


V  =  { f  c  C(IR  )  j f  has  finite  limit  at  + 


Let  (T(t)f) (x) 


be  the  semigroup  on  V 


generated  by  absorbed  3rownian  motion.  The  generator  of  T(t) 
2 

j  — ar,  with  domain  0  =  {f  e  C^Jf"  c  V,  f"(0)=G;  (see,  e.c., 
dx 

Lamperti  [JS]).  ror  :  :  0,  define 


£nuf'x(-))L 


e !  n  \  3  ! 

1 'u'? av  " 


r(x )  p  { x ,  t )  dx 
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3y  applying  the  stochastic  differential  equations  of  Altering 
(Fujisaki,  KaVhanpur,  ;<unita  [’s]  and  Kunita  [25]'  one  may  derive 


da4.(f)  =  ct(i  — j 
t  fc  2  dx2 


f)dt  +  G*(xf  )dy.'t) 


(5.371 


for 


3dz 


f  e  D. 
d2  f) 


By  substituting  (5.36)  in  (5.37}  and  integrating 
by  parts,  one  finds 


f(0)dPa(t)  -  |  f(x) 

(0,®) 


dp(x,t)dx 


»  [-  \  f’(0)p(0,t)  -  \  7(0)  p'x.t  •  ■ x_ty 


+  ff(x)  X  — J  p(x,t)dt  dx 

(o,-)  3>r 


+  f(x)xp(x,t)dy(t)dx 

(0,®) 


Since  this  must  hold  for  all  *  e  3 


1  5 


dPa'^  =  2  3x  p^x,t^  ^x=0  dt 


p(C,t)  *  0 

dp(x,t;  =  X  ?(x ,t)dt  ^  xp(x,t)cy(' 

*■ 


ne  white  ncise  version  or 


IS  , n  ^ o c  r 
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In  (5.35),  let  A,  rather  than  A  ,  denote 

,  .2  }  7 

jr  — j  -  j  x~»  to  emphasize  that  the  functions  A  acts  on  are  now 
3X  + 

defined  only  on  IR  .  The  estimation  algebra  is 


A '  =  (A,  x,  1} 

which,  but  for  the  fact  that  the  domain  of  functions  is  different, 
is  the  same  as  A  in  example  1  o~  section  5.1.  Again,  one  might  hope 
that  p(x,t)  is  solved  by 


9-1  (t)A  g2(t)x  g,(t)£-  g.(t) 

P(x,t)  =  (e  e  ^  e  dx  e  4  pQ)(x),  (5.39) 


once  the  various  operators  are  properly  defined.  However,  the 

crucial  identity  (I)  will  fail  in  this  case.  The  problem  is  the 

boundary  condition  p(0,t)  =  0.  Because  of  this  condition,  etA  will 

tAg 

net  be  the  same  as  e  of  the  previous  section.  Rather,  to 

M 

meet  this  condition  in  (5.39),  we  must  require,  (e'"  j)(0)  =  0  for 

all  relevant  ?.  A  simple  reflection  argument  on  the  kernel  G(x,y,t) 
tAQ 

of  e  (see  5.16 )  yields 


(etA?)(x)  =  ^Gj(x,y,t)p(y)dy 


Gl(x’y’;)  =  'TTfkt  €X?[-  fCOtht(xVj:sinh(4^J. 


:  5.40) 


we  try  to  prove  the  analogue  cf  (5.51)  with  A  reclacinc  ,-/e 
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then  get 


(etAx?)(z)  =  cosht(xetA?) (z)  +  sinht(4-  etA?)(z) 


+ 


/  T  -1/2  cotht(z2+y2) 
■rsinht  e 

0 


e-^/s1nht»(y)dy. 


The  last  tern  will  not  be  identically  zero  unless  $  is,  and  hence 
(I)  fails.  The  Lie  algebraic  calculation  of  section  1  to  solve 
Zakai's  equation  will  then  not  work. 


Discussion 


In  the  above  calculations,  we  verified  or  disproved  the 

tA 

crucial  identity  (I)  by  using  the  explicit  formulae  for  e  and  e 
It  is  desirable  to  explain  the  results  at  a  more  fundamental,  operator- 
theoretic  level,  i.e.,  to  understand  how  the  closely  related 
estimation  algebras  A  and  A'  involve  such  widely  variant  behavior. 

Our  first  step  is  to  reprove  the  identities  (5.13)  and  (5.19) 
of  section  5.1  by  much  more  fundamental  methods.  This  proof  will  not 
be  quite  as  strong  as  that  of  lemma  5.1  since  it  will  apply  only  to 
restricted  set  of  functions.  Recall  from  example  2  of  section  5.1  that 

O'  =  (finite  linear  combinations  of  eigenvectors } 
is  a  dense,  invariant  domain  ~cr  A.  Let  v  t  D'  be  an  eigenvector 
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with  eigenvalue  X.  From  the  commutation  relations  (5.10) 


[Aq,x]!J»  =  AqX  -p  -  XXip 


Xip  ~  [Aq, 


By  adding  (5.40)  and  (5.41)  we  derive 


(5.41) 

(5.42) 


A  (— 
Vdx 


x)*  -  A(^-  +  xb 


Since  O'  is  invariant  under  V  and  x,  +  xb  e  O'  also.  Thus 

we  conclude  that  +  x)<p  is  an  eigenvector  of  Aq  with  eigenvalue 

X  +  1.  A  similar  argument  shows  that  -  x  is  an  eigenvector  with 

eigenvalue  X  -  1.  The  following  calculation  now  proves  (5.18)  for 

tA 

?  =  -p.  Observe  first  that  e  y  =  e" p, 


e  +  =  et(ATl)^  +  XH>  etc*  Then 

tA0  tAOlr,d  .  ,  ,d  v 

e  x  -p  =  e  ^dx  +  xb  -  (^-  -  xb] 


1  -t(x+l )  ,d 


-  2  6 


V  +  X)'^  "  0  6 


1  aT(X-l )  ,d 


ldx 

t  A 


(^7  x) 


2  vdx  ’  A'J 


=  cosht  x  e'"\’  +  sinnt  — 


dx 


sinnt 


cosht  x  e 
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By  linearity,  (5.43)  extends  to  all  the  elements  of  D'.  Thus,  the 
invariance  of  the  domain  O',  which  is  also  a  domain  of  vectors 
analytic  for  A,  implies  identity  (I)  after  a  direct  calculation  using 
little  more  than  the  commutation  relations.  The  development  here  is 
reminiscent  of  the  quantum  field  theory  of  the  harmonic  oscillation 
or  the  construction  of  the  free  Boson  field.  This  is  not  accicental, 
because  deep  connections  between  quantum  field  theory  and  filtering 
exist.  (This  was  discovered  and  treated  by  Mitter  [35].) 

These  nice  domain  and  eigenvector  properties  of  the  Brownian 
motion  signal  case  do  not  extend  to  the  absorbed  Brownian  motion 
problem,  despite  the  isomorphism  between  A  and  A1.  The  root  cause 
is  the  boundary  condition  p(0,t)  =  0,  and  the  fact  that  this  condition 
is  not  invariant  under  Indeed,  we  can  see  intuitively  that 

e^Xi  f  cosht  x  e^%  +  sinht  ^  (5.44) 

because,  for  general  $  (e^^O)  =  0  and  sinht(^-  etAo)(x)  f  0. 

However,  the  fact  that  (I)  fails  is  not  apparent 
directly  from  the  structure  of  A'  because  the  boundary  condition  makes 
no  contribution  to  the  definition  of  A'.  We  can  rectify  this 
situation  by  more  careful  attention  to  the  issue  of  operator  domains. 

It  is  useful  to  think  of  as  a  representation  of  a  tie  algebra 
on  a  function  soace  7,  and  in  this  oiscuss’on  *'t  will  suffice  to 
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2 

set  V  =  L  (S),  S  =  state  space.  Recall  that  a  Lie  algebra 
representation  required  not  only  an  algebra  of  operators,  but  also 
a  dense,  common,  invariant  domain  on  which  to  define  them,  because 
an  unbounded  operator  is  not  fully  specified  until  its  domain  is 
given.  Thus  it  is  actually  inadequate  to  discuss  estimation  algebras 
without  considering  domains,  and  thus  we  attempt  the  more  rigorous 
formulation 

Definition  5.2  Let  D  c  V  be  a  common,  dense,  invariant  domain 

of  LQ  and  h.  Then  {LQ ,h Q  denotes  the  lie  algebra  of  operators 

generated  by  LQ/D  and  h/D  and  defined  on  the  domain  D. 

Remark  Domain  invariance  insures  that  all  brackets  [A, 3]  of 
elements  of  the  lie  algebra  are  again  well-defined  operators  on  D. 

What  is  the  correct  domain  D  to  associate  to  A’  when 
trying  to  solve  (5.35)  by  the  Wei-Norman  product  series  (5.39)?  Up  to 
now,  the  discussion  of  operators  in  a'  has  been  formal  since  we  did 
not  specify  domains.  However,  we  did  find  that  A,  in  conjunction 
with  the  boundary  condition  p(0,t)  =  0,  gives  rise  to  the  semi¬ 
group  etA  defined  in  (5.40).  The  proper  domain  O'  for  A  is 
then  that  for  which  A 1 D '  is  the  infinitesimal  generator  of  etA. 

For  clarity,  denote  this  infinitesmal  generator  by  A  .  It  is 
easily  seen  that 
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2 

D( A)  =  (<f>  e  L^(]R  )  j  (j>  - — ^  ~  \  el-^QR  )*o> ( 0 )  =  0} 

3X 

Henceforth,  we  will  discuss  A  instead  of  the  less  well-defined  A. 

Eigenvectors  of  A  will  again  be  important,  and  domain  considerations 

enter  into  their  definition;  is  an  eigenvector  of  A  if 

(j5-*-  -  1  x  )*  =  4)  for  some  x  and  if^(0)  =  0. 

*  dx^  c 

Now,  in  analogy  with  Lie  algebra  representation  theory, 

we  want  the  domain  D  of  A1  to  be  such  that  AID  generates 

tA  £  a 

e  also,  (in  the  sense  that  e  is  the  unique  semigroup 

s.t.  1  im  7  [etA4>-$]  =  A<p ,  y  i>  e  D).  At  the  very  least  this 
t+O  r 

requires  that  DtLD(A).  Otherwise  A ] D  will  generate  a  different 
semigroup  or  will  have  extensions  generating  different  semigroups. 

For  arbitrary  D,  of  course,  it  may  not  be  possible  to  associate  any 
semigroup  to  A { D - 

However,  the  next  theorem  will  show  that  the  requirement 
D  CD(A)  is  also  problematic  and  it  will  lead  us  to  a  deeper 
characterization  of  why  the  Wei -Norman  method  fails. 


■*  9  4. 

Theorem  5.2  Let  0c  C(a)  be  a  dense  (in  L  (E  ))  invariant 
domain  for  Then  D  does  not  contain  any  eigenvectors  of  A 

nor  does  it  contain  a  c_s:e,  invariant  domain  o£  analytic  vectors. 
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Remark  It  can  be  shown  that  A | D  has  many  different  self-adjoint, 
negative  extensions.  Thus  there  will  exist  many  other  semigroups 
U(t)  such  that 

lim  j  [U(t)<j>-i]  =  A 4,  Vp  c  D 
t+0  1 

We  conclude  that  0  does  not  have  the  structure  necessary  for 
integrating  the  elements  of  a'. 

Proof  If  ip  is  an  eigenvector  of  A  it  is  clear  that  i^(0)  =  0  and 

1  1  2 

ib'(O)  t  0,  for  otherwise  the  unique  solution  of  ^p"  -  j  x  <p  -  xp  =  0 
(x  =  eigenvalue)  is  p  =  0.  However,  if  D  C  D(?0  is  invariant  under 
A',  it  is  at  least  invariant  under  Thus  if  $  z  D,  e  DcD(A) 

V n  which  implies  =  0,  Vn-  Thus 

D  C  DCA^r^e  C*(1R+) ,  t>(n)  (0)  =  0,  Vn}  . 

It  is  immediately  clear  that  D  contains  no  eigenvectors  of  A. 
Consider  a  representation  of  the  Heisenberg  algebra  A"={-ix,~,-i } 
on  D.  As  in  example  3  of  section  5.1,  i^  is  not  essentially 
self-adjoint  on  D  and  hence,  by  the  theorem  of  Nelson  [37]  (see 
Appendix  2)  A"  on  0  does  not  integrate  to  a  unitary  group  and 
hence  does  not  possess  a  dense  invariant  domain  of  analytic  vectors 
in  D.  An  analytic  vector  for  ix  is  an  analytic  vector  for  x 
and  vice  versa,  so  D  does  not  contain  such  a  domain  for  a '  either. 

In  short,  the  eigenvectors  of  A,  ,.r. 'ch  in  the  5ro;:niar, 
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signal  example  were  the  source  of  a  dense,  invariant  domain  of 
analytic  vectors,  are  no  longer  invariant  under  the  estimation 
algebra.  Thus  if  (j  is  an  eigenvector  of  A,  Atj,  =  .o ,  it  is 
no  longer  true  that  +x)-h  and  (~  -x)y  are  eigenvectors  of  A 
and  hence  the  proof  of  (I)  on  eigenvectors  by  commutation  relations 
also  fails  for  the  absorbed  Brownian  motion  case. 

In  summary,  when  a  dense  invariant  domain  of  analytic 
vectors,  in  these  cases  provided  by  the  eigenvectors  of  A  or 
Aq,  fails  to  exist  in  the  domain  of  the  estimation  algebra,  the 
Hie  algebraic  method  of  solving  Zakai's  equation  does  not  work.  We 
shall  see  this  same  behavior  repeated  in  examples  presented  in  the 
next  section.  It  is  our  conjecture  that  the  existence  of  analytic 
vectors  for  the  domain  of  an  estimation  algebra  will  be  a  necessary 
condition  that  a  filtering  problem  with  finite-dim.  estimation 
algebra  also  have  a  finite  dimensionally  computable  conditional 
density.  Further  work  on  this  has  not  been  done. 

5.3  Finite  Dimensional  Estimation  Algebras. 

In  this  section  we  seek  to  identify  those  filtering 
problems  that  possess  finite-dimensional  estimation  algebras, 
restrict  our  attention  to  the  class  of  models 

dx(t)  =  f { x ( t ) )ct  +  3  db(t; 

cy(t)  =  nvVt) ;  -  bw(c,  . 


We  wi 


'5.-5 
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and  we  will  assume  that  x(t)  is  an  ]Rn-va1ued  process,  y( t)  is 
IR^-valued,  b ( t )  is  an  m-dimensional  Brownian  motion,  m  _>  n  and  3 
is  a  constant  nxm  matrix  of  full  rank.  Additionally,  we  will  suppose 
that  x ( t )  evolves  in  an  open,  connected  set  U  c  and  that 
f,  h  e  CT(U).  As  the  estimation  algebra  domain,  we  will  always  take 

Cj(U)  *  {o  c  C"(U)|supp<i  is  compact}, 

a  choice  avoiding  boundary  conditions,  but  imposing  no  loss  of 
generality  to  estimation  algebra  calculations,  since  the  algebra 
operators  should  be  defined  on  a  sufficiently  well-behaved  and  comDlete 
domain.  The  first  result  will  present  a  necessary  condition  that 
(5.45)  has  a  finite-dim.  estimation  algebra.  We  then  use  this 
condition  to  list  all  possible  finite-dim.  examples  in  the  scalar 
version  of  (5.45),  n  =  m  =  p  =  1.  Finally  we  discuss  in  which  of 
the  scalar  possibilities,  Zakai's  equation  can  be  solved  by  the 
method  of  section  5.1. 


Conditions  for  Finite  Dimensional itv 


The  Zakai  equation  for  (5.45)  is 


3p(x,t)  . 


=  Lnp(x,t)  +  (  J  h.(x)y.(t))p(x,t) 
0  i=l  1  1 


1  -  , 

2  A  /ij 
i  ,J  =  1 


n 


,  P 


2 ■  , 


- — - "  ~ —  £  .  ( x )  -  4-  h  f '  x ) 

?x,  ;X .  .  =  ,  ;:< .  2  > 

l  i  t  “  I  I  .  “  i 


'5.4? 
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Consequently,  the  estimation  algebra  is  defined  to  be 
A  *  {L0,hr"‘,hp}LA’C“(tj)  • 

The  following  lemma,  which  incorporates  an  important  Lie  algebra 
calculation,  is  needed  in  the  first  theorem. 

Lemma  5.2  Let  g  e  C*(U).  Then  vk 

[adL0]kg-  I  Ok9(fl  —.A  (5.47) 

V*’V]  1  k  *i  *k 

+  terms  with  lower  order  differential  operators 

In  (5.47)  0!cg(-'-)  denotes  the  kth  differential  of  g  considered 

as  a  symmetric,  k-1 inear  function,  and  A£  denotes  the  column 
of  A. 

Proof  For  k  =  1,  a  direct  computation  will  show 

,  L  Sr  »,J  Sr  tr  (A-O^g)  -  7g-f] 

1  9 J”  l  1  J 

for  <p  e  cq(U).  In  the  last  term, 

02g  =  [02g(ez,ek)]  , 


=  kth  standard  basis  vector 


and 


79  l3X1  ’  ’?xn  ' 


HI 


For  k  >  1,  the  proof  proceeds  by  induction.  The  details  .-/ill 
not  be  presented. 


The  next  theorem  gives  the  necessary  condition  for  toe 
model  (5.45). 


Theorem  5.3  If  dim  A  <  «,  h^ (x) , • • • >hp ' x ' 
of  degree  £  2.  .More  generally,  if  g(x)  c 
must  be  a  polynomial  of  degree  <_  2. 


must  be  polynomials 
is  in  A.  g(x) 


Proof  Fix  g(x).  The  sequence  {[ad  is  contai'neG  'n 

A  and  hence  cannot  nave  operators  of  arbitrarily  high  order. 
Because  of  lemma  5.4,  this  implies  that,  for  some  k. 


skg(A  » •  •  •  .A  )  =  0 

1 1  k 

for  all  4-]  ,  •  •  • » 1  1  i-] .  •  *  *  ,it<  in. 
this  means  that 


Since  A  is  non-singular, 


g  =  0 


for  all  z^  , • • • ,z^  1  1  ,  •  •*  in.  Thus  g  i  s  polynomial  of 

degree  k  -  1 . 

The  sequence  of  functions  •' a„ ( x  , 

n  n-j 
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aQ(x)  =  g ( x ) 

3-|(x)  “  CL>-g>ag(x)j,  2gi.x)j 

an(x)  ‘  [[L0'an-l(x)]’  an-l(x)] 
must  also  be  in  a.  Another  tedious  calculation  shows  that 
an(x)  =  7an_1(x)  A  7Tan-l(x)  . 

If  g(x)  is  a  polynomial,  then  clearly  {an(x)}*_g  is  a  sequence  of 
polynomials.  We  claim  that 

deg  an(x)  =  2  deg  an_](x)  -  2  .  (5.48) 

To  prove  this,  observe  A  >  0.  Hence,  there  exists  a  matrix  S 
such  that 

SAS '  =  diag  [>.,,•••  ,An],  >0  1  *  1,  •••  n 

so  that 

an(x)  =  7an_1 (x)S-^  SAST(S"^ )T7Tan_i (x) 

n  -17 

=  7  (7a  ,(x)S  ')T 

i=l  n_1  1 

But  7a  2  '  will  oe  a  /ector 

n-  i 

of  wni  cn  will  be  v dec  a  .  -  I. 

n-  . 


oclyncmial s ,  one  hignest  cecree 
’  = . i£  •  clearly  "bl  lews  -'Z~  t.v  $ . 
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Now  suppose  deg[g(x)]  >  2.  (5.4 8}  will  then  imply  that  the  sequence 
deg  an(.<)  will  increase  without  bound  and  so  admit  polynomials  of 
arbitrarily  high  order  into  A.  But  this  cannot  happen  if 
dim  A  <  Therefore  deg  g(x)  <_  2. 

In  the  scalar  case,  theorem  5.3  may  be  used  to  impose 
conditions  on  the  drift  f(x)  and  so  to  obtain  n.a.s.c.'s  for 
fini te-dimensional i ty . 

Theorem  5.4  Let  n=m=p=l,G=l.  Then  dim  A  <  »  iff 

( i )  h ( x )  =  ax  +  3 
f  +  f2  3  ax  ^  +  bx  +  c  . 

(ii)  h(x)  =  ax"  +  3x  +  y,  a  t  0,  and 

f  +  f2  =  -  h2  +  a(2ax-s)2  +  b  +  — - - 

(2ax+s) 

or 

2  2  2 

r‘  +  f  =  -  h  +  ax  bx  +  c 

Proof  Assume  dim  A  <  ®  and  h(x)  =  ax  +  s.  The  function 
r^x)  =  [Lq  i  [Lq  »h]  ] 

2  1 

=  a~x  +  f"  +  ff 

is  in  A.  Hence  by  theorem  2, 

2,2, 

r.  ix;  =  ex  -  b/2 

t 


(5.49) 


(5.50) 


for  seme  constants,  e,  a,  and  b.  "Thus 


iCf+f2]'  =  1 


ff  =  ex  -ax 


-  b/2  , 


and  hence 


Likewise 


2  2  3  2 

f‘  +  t  =  j  ex°  +  ax^  +  bx  +  c 


r2(x)  =  [Lg,  [Lg.Ti  (x)]]  -  4eLf- 


(5.52) 


=  2e(2:tx+3)^  +  2e[f'xf~]  - 
( 2ex+a-*-a^ )  [■^'f " -^f" ( x/+f)j 

is  quadratic.  But,  by  substituting  (5.52)  into  (5.53)  we  find  that 
r^(x)  contains  the  term  e  [y]x  .  Hence  e  =  0.  Thus  f‘  -  f  is 
quadratic.  Conversely,  if  f(x)  satisfies  (5.52)  with  e  =  C, 


A  =  Span  {L0,  x,  1}  , 


which  is  finite-dimensional. 


Next  suppose  that  h(x)  =  axd  -  £x  x  v,  -  f 


Agai  n 


>*(x)  =  [Lg  ,  [Lg,h]]  -  4aLg 


=  2aCf ’  +f2]  -  \  (2ax+£)[f,,J-2f,f:  -  2ah2 


,  f  r*  .  \  i  | 

-  \2xx-3,nn 


is  a  ouadratic  ‘unction  C ( x  ) .  Let  c  r  -  f.  ke  see  t.nat 
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satisfies 

2ctz  +  ^-(2ax+3;z '  =  -  2^h"  -  [22X+£)hh'  -  Q(x)  .  (5.54) 

The  general  solution  z  of  this  equation  is 

z(x)  =  -  h^(x)  +  Q,(x)  +  - - —  ? 

1  (2ax+S, 

where  Q-j(x)  is  another  quadratic  function  solving 
2aQ1  +  I(2ax+5)Q^  =  Q(x) 

and  x(2aX+3)*“  is  a  solution  of  the  homogeneous  part  of  (5.54). 

If  Q-.(x)  f  u(2ax+s)  +  v,  then  by  taking  an  appropriate  linear 
combination  of  Q(x)  and  n(x)  we  may  show  that  x  z  A.  It  then 
turns  out  *^at 

[LqCLq.x]]  =  Q-j  (x)  -  2.\ct(  2ax+S ) 3 


is  in  A.  But  this  must  be  quadratic  and  hence  =  0. 

Conversely  if  Q-j(x)  =  ’_(2ax-rS)“  +  u,  then  X  may  be  non-ze^o 
and  the  estimation  algebra  is 


=  Span 


av(2ax- 

u 


9 

-3);  (2: 


./-£ - 

cx 


; \f!  T- 

\  -  — A  -  ,  '  j  5  *■ 


.  2  ,  2 

n  f  ax  1  ox  *  c  anc 


soecial  form 


is  net  of  the  above 
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A  =  Span  {L-jX^.x,  x- -  xf,  4 -  f,  I' 

0  CX  GX 


This  completes  the  proof. 


Remark  Case  (i)  compares  to  the  Benes  [  1  ]  examples;  a  form  of 
(ii)  appears  in  Brockett  [6]. 


3y  a  simple  transformation,  we  may  extend  this  result  to  the 
filtering  model 


dz(t)  =  f(x(t))dt  +  g(z(t))db(t) 

(5.56) 

dy(t)  =  h(z(t) )dt  +  dw(t) , 

and  h,f,g  sC"([j),  U  is  an  open  interval  of  R,  and  g(z)  >  0  for 
z  s  U.  Consider  the  differential  equation 

• fc'(x)  =  g( e (x ) )  3(0)  =  zQ  c  U. 

Let  I  denote  the  maximal  interval  about  x  =  0  on  which  the 
solution  e(x)  exists.  3  maps  I  onto  U,  is  infinitely 
differentiable  on  I,  and  is  invertible.  These  statements  are  easily 
demonstrated  from  differential  equation  theory.  Next  let 

f(:<)  =  — Cf(9(x))  -  Je-(x)] 

and  suppose 

dx( t)  =  f  ( x ( t ) ) tit  *  db ( t )  . 

By  applying  Ida's  rule,  or.e  -'ires  that  i  *  :(t)  =  ~ 
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dz(t)  =  f(z(t))dt  +  g(z(t))db(t)  . 

Hence,  (5.55)  may  be  replaced  by: 

dx(t)  =  f(x(t))dt  +  db(t)  (5.57) 

dy(t)  =  h(a(x(t)))dt  +  dw( t) . 


The  Lie  algebra  analysis  can  now  be  carried  out  on  (5.57).  The 

]  a  ^  9  ,  1  ? 

estimation  algebra  is  A-j  =  {j  2—^  9w(t)  -  rr  f( z)  -  j  h  (z)  ,h(z)  >LA, 

3Z 


r  1  3  3  ~c  I  \  1  /  L.  \  2 


that  of  (5.57)  is  A2  *  ij  — j  f(x)  -  ^(h-a)  ,h*a>LA  . 

3  X 


If  i : A 2  -*•  A-j  is  defined  by 


(i[S]-v)(z)  =  B**e(x) 


x=a_1(z) 


one  sees  easily  that  i  is  an  isomorphism  of  A-|  and  A^ .  Hence,  we 
derive. 

Theorem  5.5  dim  A-j  <  »  iff  dim  a?  <  =°. 

"heorem  5.5  says  that  any  "inita  dimensional  est'~ati:n 
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algebra  for  the  model  (5.56)  with  g(z)  >  0  can  be  reduced  by  state 
space  diffeomorphism  to  one  of  the  cases  in  theorem  5.4. 

Solution  of  Zakai's  Equation 

Which  cases  among  those  singled  out  in  theorem  2,  allow 
a  solution  of  the  filtering  problem  via  the  method  of  section  5.  ? 

To  answer  this,  it  is  first  important  to  characterize  the  drifts  f 
solving  the  equations  (5.49)  -  (5.51).  It  turns  out  that  f  may 
explode  for  finite  x,  that  is,  the  maximal  interval 
U  =  (rg,r, )  on  which  f  can  be  defined  may  be  only  bounded  or 
semi-infinite  because  ! f ( x )  [  -  »  as  x  -  bounded  endpoint  of  (J. 
Despite  this  a  signal  solving 

dx(t)  =  f(x(t))  dt  +  db(t) 

x(0)  e  U  (5.58) 

will  still  exist,  but,  in  general  only  up  to  a  stopping  time,  t, 
at  which  it  attains  a  boundary  point  of  U,  (Gihman  and  Skorohod 
[  17]).  The  theory  of  diffusions  on  bounded  intervals  must  now 
be  applied  to  proceed  further.  It  says  that  to  specify  x(t) 
for  t  >  t,  one  must  impose  conditions  that  tell  how  the  process 
behaves  at  the  boundary,  i.e.,  whether  it  is  absorbed,  reflected, 
terminated,  or  seme  combination  cf  these  three.  Always,  when 
x(t)  t  'J,  it  is  assumed  to  solve  (5.5c).  -"e  tneory  a’so 
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indicates  that  even  if  x(t)  does  not  hit  the  boundary,  so  that 
x(t)  can  be  defined  as  a  solution  of  (5.53)  without  additional 
conditions,  different  process  behavior  near  the  boundaries  can 
occur.  Whether  or  not  x(t)  attains  the  boundary  and  how  it  acts 
near  a  boundary  depends  on  the  nature  of  f. 

To  study  f  and  (5.58)  we  state  some  preliminary  results 
from  diffusion  theory.  In  our  definitions  and  statements  we  follow 
the  exposition  of  Mandl  [  27];  the  original  reference  is  Feller 
[  12].  Let  f  e  (U),  U=  ( rg , r^ )  and  consider  the  operator 

B  =  i  +  f  (x )  — 

?  dx2  f<x)  dx  • 

Let  r  e  (rQ,r.,)  and  define 

rX 

c(x)  =  2  j  f(s)ds 

-  r 

fx 

u(x)  =  j  dy  e"C'-^ 

J  r 

.x 

i 

i 

*  r 


d'j  e 


c(y) 


dz  e 


I* 


dz  e 


C(2) 


v(x)  = 


-  p  (  -*  ^ 


L 
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Oefi ni tion  5 .  3  For  -he  operator  3, 

The  boundary  point  r.  is 

(a)  an  inaccessible,  natural  boundary  if  -_ ( r ^ ) =J0 ,  ;(r^ 

(b)  an  inaccessible,  entrance  boundary  if  y(r.)=«,  v(r 

(c)  an  accessible,  exit  boundary  if  u(r1.)<*,v(r.)<». 

(d)  an  accessible,  regular  boundary  if  u(r1-),v(ri)  <  * 

Let  C(U)  :={*(x)  ,xe'J  |  o  is  continuous,  1  im  i/(x)  exists  and  is  fi 

x-t-ri 

i  =  0.1} 


and  consider  the  differential  operation  B 
on 

0(B)  =  {*  e  C(U)  |B  v  £CtU); 


Lemma  5 2 

i)  If  the  boundaries  of  U  are  inaccessiole  then  3  on  0(3) 
a  unique  Markov  semigroup  on  C(J) 

ii)  Suppose  3  has  an  accessible  boundary.  Define  D'(3)  as  the 
of  -it  z  0(8)  s.t.  at  a  regular  boundary  r^ 

0  =  p^  i(ri )-(-!  )1-t  i  lim  e  'X/  j(x)  + 


0, 


pi 


ni  te , 


eneraoes 


set 
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and  at  an  exit  boundary  0  =  p.-u(  r- )  +s  -3u( },  ,  p^-mr^C. 

Then  3  on  D' (3)  will  generate  a  Markov  semigroup  on  C  ( 'J) . 

Proof.  Mandl  [ 27  ] . 

Remark .  The  boundary  conditions  in  lemma  5.5  ii)  are  called  local. 
Mora  general,  "lateral"  condi tions, corresponding  to  having  toe  process 
jump  to  a  point  inside  U,  are  possible,  but  for  such  signals  Zakai's 
equation  no  longer  holds. 

For  a  domain  with  inaccessible  boundaries,  let  x(f,xn)  denote 
the  solution  of  (5.53)  with  x(C;Xg)  =  Xg  s  U .  Then  3  on  D(3) 
generates  (T(t)g)(xQ)  =  Eg(x(t;xQ))  for  g  e  C  ( TJ) .  Construction  of 
Markov  processes  that  correspond  to  the  semigroup  generated  in  the 
case  of  accessible  boundaries  with  local  boundary  conditions  may  oe 
found  in  Gihman  and  Skorohod  [  17 ]. 

We  shall  now  analyze  the  solutions  f  of  ( 5 .49 1 -{ 5 . 51 )  in 

tX 

terms  of  this  theory.  Given  a  function  f,  let  v  =  exp  f,  or, 
vice-versa,  given  v,  let  f  *  v/v.  Then  f  satisfies  (5.43),  15. EC)  o 
(5.51)  respectively ,  if  and  only  if  v  satisfies 


v 


C ;  v 
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or 

v"  =  (-h“  -  ax"  •)»  bx  t  c)  v  ( 

respectively .  Clearly,  singularities  of  f  will  arise  at  zeroes 
v.  The  situation  is  summarized  in  the  following  lemma.  Note  fn.at 
special  attention  must  be  given  to  ( 5 . 6G )  because  of  the  singular' 
of  the  coefficient  at  =  -s/2;T. 


r  -  A 

Lemma  3.  •f 


i)  Suppose  that  f  is  a  solution  of  (3.-9)  or 
becomes  singular  at.xQ.  Then  f(x)  =  ~ —  -  C( 

^  _  3 

holds  true  if  f  solves  (5.50)  and  xq  /  ^7. 

ii)  If  f  is  a  solution  (5.50)  and  c  _>  -  j, 
solution  on  an  interval  with  endpoint  xq  =  fr 


(5.51)  anc  that  f 
I)  as  x-x„ .  "he  sam 


then  f  can  have  a 
and 


■\x , 


X  — >  Xo 


where  a( 2-1 ) =c . 

iii)  If  c  <  •  j,  f  is  not  defined  on  an  interval  wit.r  endpcint 
or  containing  Xq . 

hen  V  1  ; x- (  =  c  (or  otnerw'se  •  5  1  ■ 

hut 
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v’(xn)+v“(xn)(x-x _)+••• 

f(x)  =  - 2 - 2_ - 2 - 

v1  (xQ)(x-xQ)+jv"(x0)(x-x0)  +  ■ 


=  ^z:  +  0(T) 


( i i )  It  suffices  to  consider  the  case  £  =  0  and  2  =  1  ,  for  which 


xQ  =  0,  and 


v"  =  (-h”  +  ax^  +  b  +  — j-)v  . 

x“ 


If  c  >  -  j  2(2-1)  =  c  has  two  solutions  3]  <  7  <  'J2  and  7  has 
series  solutions  near  zero 


n 

Vi U)  =  x  7  a  xn ,  aQ  f  0 
1  n=Q  n  u 


32  r  .  n 


v2(x)  =  x  >  anx  ,  bc  t 


n=0 


For  each  solution 


f  =  —  +  0(1 
x . 


^  — ►  0 


=  -  I/i,  t.ne  solutions  7  are  generally  0 *  tne  £or~, 


For  c 
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v,tx)  =  J  ax" 

1  n=0  n 

v?(x)  =  (in  x)  x1/2  (  l  a  xn)  +  x^2  l  b  xn 
c  n=Q  n  n=G  n 


Agai  n 

fir*"!'1  ■ 

(iii)  If  c  <  ~  jr»  the  solutions  v^  and  v ^  are  still  valid,  but 
now  ct-j  and  are  compl  ex 


a . 
1 


/j4c+l 

2 


Thus  sines 

1/2  z  1  4c+l i  1/2  :  i  i  »4c+l  in  x 

any  real  solution  v  will  have  an  infinite  number  of  zeroes  in  any 
neighborhood  of  zero.  Since  each  of  these  zeroes  will  correspond  to 
a  singularity  of  f,  f  cannot  be  well-defined  in  an  interval 
containing  C  or  with  0  as  an  endpoint. 
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From  lemma  5.4  we  can  derive 
Lemma  5 . c 

Let  f  be  a  solution  of  any  of  the  equations  (5.49)  - 
(5.51)  on  U  =  (rQ,r-|).  If  r.  is  finite  and  lim  )f(r. )  <  ®  ,  r. 
is  regular.  If  r^  is  finite  and  f(x)  =  ~~r  +  1 0 ( 1 )  (x-*r.)  r..  is 
an  entrance  boundary  if  a  >_l/2,  a  regular  boundary  if  a  e(-l/2,l/2), 
and  an  exit  boundary  of  a  <_  -  1/2.  If  r..  is  infinite  (+  »  or  ~») , 
it  is  natural  . 

Proof  For  the  finite  boundaries,  calculate  j(x)  and  v(x)  and 
apply  definition  5.3.  To  prove  that  an  infinite  boundary  is  natural 
it  is  necessary  to  know  hew  f  behaves  at  that  boundary.  It  will  oe 
shown  later  that  f  can  exist  in  a  semi -i nfinite  or  infinite  domain 
only  if  it  solves  (5.49).  Suppose  for  instance  that  r,  =  +  ».  It 
turns  out  that 

?  2 

f  +  f"  =  ax  +  bx  +  c 

whether  either  a>0  or  a  =  C,  b  >  0,  or  a=0,b  =  0,c>_0. 

In  any  of  these  cases  f(x)  can  grow  at  most  like  x  as  x-«°  and 

from  this  one  can  snow  limu(x)  =  lim  v(x)  =  +•*. 

X-k»  x-*o° 

Let  us  take  up  the  question  raised  in  the  last  proof; 
which  equations  of  ( 5 .49) -( 5 .51 )  nave  solutions  on  which  kinds  of 


doma ins? 
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Lemma  5.5 

Suppose  f  satisfies  (5.50),  (5.51)  or  f’+f =ax“-px-c 
and  either  a<0,  or  a=b=0,  c<0.  If  f  is  defined  on  U=(rg,r '■  and  has 

no  singularities  in  U,  then  U  must  be  bounded.  If  x  satisfies 

'  2  2  2 
f  +f  =ax  +bx+c  and  r.  is  infinite,  lim  ax  +bx+c  >_  0 . 

x-ri 

Proof .  The  proof  applies  the  Sturm-Liouvi 1 Ie  comparison  theorem 

(see,  e.g.  Coddington  and  Levinson  [  3  ]).  Suppose  f  satisfies 

rx 

(5.51).  Then  v  =  exp  I  F(x)  satisfies 

J  r 

v"  +  (h^-ax^-bx-c)  v=0 
Since  h  is  quadratic,  for  ! x ;  large  enough 

2  2  2 
h  (x)  -  ax  -  bx  -c  >  1  . 

Hence,  by  the  Stum-Liouville  comparison  theorm  v  must  have  a  zero 
between  any  two  successive  zeroes  of  any  solution  of 

5"  +  e  =  0. 

Since  t  =  c.cosx  +  C2Sinx  we  see  that  v  must  have  a  zero  in  any 
unbounded  domain.  ~hus  £  can  be  defined  without  singularities 
only  in  unbounded  domains.  The  other  statements  are  orcved  analogous 

The  interesting  ’-esults  contained  ;n  th-se  lemmas  zrc 
I  :  that  nc  ni  te  bcuncary  arising  *rom  an  -  solving  )5.-5  ,  .5.56) 
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or  (5. 51)  wi 11  be  natural  and  (2)  that  finite  boundaries  will  always 
be  encountered,  except  possibly,  with  (5.43).  It  will  turn  out  that 
t.nesa  non-natural  boundaries  will  generate  non-trivia!  boundary 
conditions  for  Zakai's  equation.  The  general  implication  is  then  that, 
just  as  in  the  absorbed  3rownian  motion  example,  these  boundary 
conditions  will  interfere  with  the  attempt  to  apoly  Lie  algebraic 
techniques.  It  would  then  follow  that  for  scalar-signal  mcdels 
of  the  type  ’5.53)  only  the  known  examol es  of  3enes  can  be  treated 
by  the  method  of  section  5.1. 

It  is  difficult  to  prove  this  statement  in  such  sweeping 
generality.  First,  there  is  a  problem  of  formulation.  If  x(t; 
sol ves  : 


dx(t)  =  f ( x ( t ) )  dt  +  db( t) 

on  a  finite  domain  'J  with  regular  accessible  boundaries,  it  is  no 
longer  possible  to  characterize  the  conditional  distribution  just  by 
a  density  p(x,t)  on  U;  one  must  also  consider  the  conditional  mass 
distributions  ?f.(t)  *  Eg’-1  ;x(  t)=r .  '-!~t  •  Ft '  r°r  3enera]  '0C3‘ 

boundary  conditions,  we  must  solve  a  system  of  equations  for  the  triple 

{?  (t),  p(x,t),  ?  (t)),  and  Lie  algebra  techniques,  if  any,  must  be 

r0  rl 

aoolied  to  this  system.  Thus  in  the  analysis  or  Lie  algebraic 
techniques  to  follow  we  will  avoid  the  accessible  boundary  case. 
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Remar k .  For  certain  boundary  conditions  in  the  regular  accessible 

case  the  system  (P  ,  p(x,t),  P  (t))  degenerates  into  just  a 
r0  rl 

density  equation  with  boundary  condition;  if  r-j  is  pure  reflecting, 

i.e.,  D '(B)  only  if  ~  =0,  then  P  (t)=0  and 

ax  x  ri  r. 

•;4 - f  ( X ) :  p  ( X ,  1 1  =0,  ft;  if  r.  is  pure  absorbing,  i.e.,  :  D'(B) 

o  x  x  -  r  l 

iff  3i(r.)=0,  tne,,  d"'r_(t)  =  [  P(x  ,t)  ix_r  ]dt  and  p(rj5t)  =  0,  Vc. 

(see  Pardoux  [4Q  ]).  Theorem  5.5  will  hold  for  these  cases,  but  we  will 
not  work  out  the  details. 

Secondly,  the  case 

f1  +  f2  =  -h2  +  a(2 zx+i)2  +  b  +  — -  -  c  t  0 

(2ax+s)  C  >_  -4 

on  an  interval  with  x^  =  -3/2a  as  an  endooint  poses  analytical 

difficulties,  both  because  of  the  variety  of  cases  f(x)  '  — - - ,  X— »X0 

x-xQ 

a  -:IR,  to  be  analyzed  and  because  of  the  singularity 

- 5  in  f'  +  f  .  In  the  interests  of  simplicity,  we  will  not 

2ax+r. 

treat  this  case. 

From  now  on,  we  will  be  interested  in  solving  Zakai's 
equation  for  the  problem  of  filtering  a  signal  xft)  solving  ,5.58! 
under  the  assumptions: 
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160  - 


'/ie  give 
For  g  £ 

If  g  £ 

(Kunita 


In  order 


a  brief,  formal  derivation  of  this  for  the  case 

'J 

:  C(U)  oefine 

^r1 

fft(g)  =  E0‘*9(xt)Lt!F£}  3  !  g(z)p(z,t)dz  . 

jro 

0(3), 


dot(g)  =  -t ( 3g ) at  +  ?t(hg)dy(t) 

[25 j).  3y  integrating  the  term  ?t(3g)  by  parts  we 


dct(g)  =  f  g(z)dp(z,t)  dz 
-  rn 


1 


g(z)[B*p(z,t)dt  +  h(z)p(z,t)dy(t)]dz 


\  eC  ^X;g'  (xje^  ^p(x,t)  | 


rr 


g ( x ) Cf^  -  2f(x)]p(x,t; 


rr 


ro+ 


t.nat  this  hold  for  general 


2  77'i7:'(x)  • 

3x 


1(3),  we  ^eouire 


-P(x,t ;  =  2+p(x,t)ot  -  h(:<;p(x,t;cy(t 


m*., 
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Tim  g1 (x)p(x,t)  =  0 


!  .  co , 


x-ri 


lim  g(x)[f-  -2f(x)]p(x,t)  =  0  . 

x-r . 

I 


:or  g  £  D(B)  one  can  easily  show  that 


lim  (f-  -  2f(x)}p(x,t)  =  0 
x-r. 


f  ~ 

\  ~  ' 


implies  both  (5.57)  and  (5.65).  When  writing  (5.65)  with  y(t)  instead 

2 

of  dy(t)  one  must  add  the  Wong-Zakai  correction  term  -1/2  h  (x;  to 
3*.  This  proves  (5.52)  -  (5.63). 


Rather  than  work  with  (5.62)  directly  it  is  convenient  to 


invoke  the  "aauge"  transformation 

r* 

-e/ 


eF(x)  =  e‘r 


i  „r(z)dz 


r  t  (r0.r,) 


and  write 


p(x,t)  =  e~r 'x,'q(x,t) 


A  calculation  shows  that 


so  .1 


*  -:4  j  :* 1  ;q( x , t)  -  y ( t ) n ( x ) a ( .< , t ; 


it 


\=-cc. 


-  1*2  - 


Hm  e^'x^  Ctt  -  f(x)J  q(x.t}=0  if  r.  is  an  entrance 

w,  3  1 

boundary 


(5.69) 


It  is  this  equation  that  we  seek  to  solve  by  Lie  algebraic 
2 

methods.  Let  A  =  j  -  |-[f,+f2+h2]  and  call  { A , It } LA  the 
algebra  of  (5.68).  is  isomorphic  to  {LQ,h}LA  by  the 

map  B  -*■  e  F  8  e  F  from  {A,h}^  to  {Lq,H}^.  This  map  also  trans¬ 
lates  evolutions;  that  is  if 


tA_, 

lim  [•-—■-  ] <<,  =  A 4  $  e  0 

UO 


lim  l[eFetAe  F-i]*  =  l  e  eFD 
UO  C  U 


Thus  the  lie  algebra  analysis  may  be  carried  out  on  (5.68). 


rhe  technique  we  want  to  explore  is  that  of  solving  (5.68) 


9i ( t)A  g?(t)X„  gri(t)X. 
q(x.t)  =  (e  1  e  2  •  •  *e  d  d 


qQ)U)  . 


(5.70) 


9i(t)A 

We  place  e  first  in  this  series  because  we  need  an  evolution 

that  must,  in  general,  satisfy  boundary  conditions.  It  is  not  usually 

possible  to  do  this  with  first  order  operators,  and  A  is  of  second 
order.  Different  second  order  operators  in  {A,h}^  might  be  chosen 
instead  of  A,  but  this  will  make  no  difference  in  what  follows. 


1 


_  1  54  - 


U  is  bounded  or  semi-infinite 

(i)  -A  is  self-adjoint  and  bounded-bel cw  on  3(A)  and  hence 


generates  a  semigroup 


Also,  A  has  a  discrete  spectrum. 


(ii)  Let  D'(A)  =  {finite  linear  combinations  of  eigenvalues  of  A} 


os  n 

J  v  r  (  F  ~j*  [sd  A]  h)e  •>  ror  -?  z  O' (A), 
n=0  n’ 


f  0  . 


(5.71) 


Proof  Assume,  without  loss  of  generality,  that  h(;<)  =  x  in  the 

2 

linear  case,  and  h(x)  =  x  in  the  quadratic  case.  It  is  easiest 
to  prove  the  theorem  separately  for  the  cases  U  bounded  anc  J 
semi-inf ini te . 


Let  (J  be  bounded,  then 


A  =  '  i [fl  +  f2  +  h2] 

L  dx  c 

1  0  "7 

and  ^[f ' +f“+h^l  is  a  bounded,  C  *  "unction  on 
that  (H)  excludes  the  case  in  which  ri  +•  a 


U.  (Recall 


that  t,H)  excludes  the  case  in  which  *  +•  *  may  have  a  singularity 

in  U.)  A  standard  calculation  from  c  i  *  *e  rer.  t '  a  ;  eo  -  at  *  on  tree  ry 

(Coddington  and  Levinson  [3])  shows  that  A  is  self-adjoint  on 

D(A)  and  -A  is  bounded  below.  Further,  the  theory  of  sel "-adjoint , 

differential  operators  on  bounded  interval  s  imp!  •* es  that  a  on  2{-] 

has  a  discrete  soectrum  \  ,  s.t. ,  \  >  •>  _•  ,  1  in  v  s  -  3r- 

n  n  n-  i  [ 

^—30 


anc  a  corses ocr.ci no  ccmc.eoe  seo 


,  tA  w  \  f 
(a  -?)(x)  =  | 


C  l  e  n  (x)v  (y)]o(y)dy 
n=0  n  n 


(5.72) 


(Coddington  and  Levinson  [8  ]).  One  now  proves  (ii)  case  by  case. 

2  2  2  2 

To  illustrate  we  do  the  example  h  =  x  and  f'+f  =-h  +  ax 

The  estimation  algebra  {A,h}.  ,  is  then  spanned  by  {A,x’,x  — ,  1} 

Ln  *  uX 


and  has  commutation  relations 


[A,  x2]  =  2x^+1 


[A,  x  |j]  =  !A  t  2a  x2  *  c 


From  these,  one  readily  derives 


a o 

l  [ad  A]nh  =  j  [cosh  2/at  -  1  ]A  +  [cosh2v'at[x~ 


+  ^  (sinh  2/at)x  t-  -  g(t) 
/a 


g(t)  =  — —  sinh  2/at  +  cosh  2/at  -  ir 


2  /a 


2a 


2a 


'low  'et  us  check  '5.72;  on  „■  .  Observe  first  that 

n 


wa  n 

O  =  3 


1  56  - 


and  ^(r.)  *  iSthervn  se 


V  ri 


'n  '  :  ■ 


'n  s0''- 


n  us 


wee : d  ' ~D . y 


»  ,n  _ 

r  t  r  ,  1  ^  '  ,«A  /  ' 

,  —r  [ad  A]  n , a  yj.x, 


n=0 


n  ‘ 


n 


^  T 

=  e  n  a_l/^(sinh  2/a  t)x  ^(x) 


5.7; 


1  0  ^  • 

>na  (cos  2--a  t-1  )-g(t)- x-cosh2/a  t)e  n  vn  ( x )  . 


This  will  not  satisfy  the  boundary  conditions  :(r-)  =  0  ,i  =  2,1 

*  1 

because  of  the  ;T(x)  tem.  ~hus  (5.73)  cannot  equal  eLrh-„  since 


tA  a 

by  (5.72),(e  \)(r.)  *  0,1  *  0,1.  ‘low  let  u(x)  *  /  a  t  ,  M  < 

1  n=l  n 


'her, 


co  n 

(  7  [ad  A jnh )  evA  v  (r  ;■  =  0.  i  =  0,1 
n=0  n- 


only  if 


•  3  , 


j=Q  - 


(see  5.73).  Since  the  A  •  are  unequal  this  cannot  happen  unless 

J 

a-  *  0,  j  =  1  ,  •  •  * , n .  The  statement  analogous  to  (ii)  with  h  replac 

by  2x  —  is  also  true,  "he  othe^  cases  '.nvol/ina  bounced  J  are 
J  ax 

oroved  in  the  same  manner. 


■  i  n ' te . 


'lew  let 


-s  .vas  srewn 
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2  > 

case  can  only  occur  if  f  +  f  =  ax"  +  bx  -  c  and  a  >_  0  at  the 
very  least.  Hence  the  operator  A  is  of  the  fom 

j  -  i  [x^  +  ax"  +  bx  *  cl 
c  dx^  * 

12  2 

and  1 im  »[x  +ax  +bx+c]  =  ®.  It  is  then  known  that  A  is  self-adjoint 
x-*®  ? 

on  0(A)  =  { ^ | A  v  t  L  (U),^(rg)  =  0}  with  a  strictly  decreasing 
sequence  of  eigenvalues  \ . .  The  analysis  then  proceeds  exactly  as 
for  the  absorbed  Brownian  motion  case.  (Indeed  by  a  transformation 
2  =  vx  +  5,  y'(t)  =  ey(t),  (5.58)  with  boundary  condition  q(0,t)  =  0 
becomes  the  Zakai  equation  for  absorbed  Brownian  motion),  "he  proof 
for  the  case  U  =  (-«,r^)  r,  <  =°,  is  the  same. 

■Remarks 

(i)  Theorem  5.3  implies  that  the  crucial  lie  algebra  identity  (I) 
of  section  5.1  fails,  and  hence  that  the  method  (5.70)  of  solving 
(5.63)  will  not  work.  The  result  is  that,  exclusive  of  tne  cases  net 
satisfying  the  assumptions  (H)  the  Lie  algebra  technique  works  only 
for  the  case  treated  by  Benes  in  which  r  is  a  global  solution  of 

f  +  f^  =  ax^  +  bx  +  c. 

(ii)  The  one  nen-rigorous  point  in  the  above  calculation  was  tne 
derivation  of  Zakai ' s  eouaticn.  However,  the  equation  ) 5 . £ 3 1  *or 
c(x,t ;  can  be  rigorously  derived,  i"  t re  :i*“2rentiaoiI  i ty  o*  q.*,t 
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is  assumed.  Indeed  if  (n,F,?)  is  a  probability  space  cn  which 
3(t)  and  y(t)  are  independent  Brownian  motions;  it  can  be  sncwn  by 
change  of  measure  techniques  that 

t  t 

q(x,t)  =  E  { exp [ j  h(xQ+b(s))dy(s)  -  j  !  [f '+f2+h2](xQ+b(s) )ds] 

0  Jo 

*  V<T>,FU3(t)+x0=x}r(x’t) 

where  t  =  inf{tjxg+B(s)  s  Uc>  and  r(x,t)  is  the  density  of 
B(t)  +  Xg  (Liptser  and  Shiryayev  [23]).  The  techniques  of  Zakai 
[47]  applied  to  this  object  yield  (5. 63), and  the  bounaary  conditions 
q(r1-,t)  =  Q  at  finite  boundaries  r.  are  clear.  The  details  will 

not  be  presented. 

It  is  in  trying  to  justify  Zakai 's  equation  rigorously  that 

the  singularity  - - - 0  at  x„  =  3/2 -  causes  trouble  and  is  wnv  we 

(2M+S)2 

excluded  the  case  from  the  analysis  of  theorem  5.5.  Nevertheless,  i * 

one  presumes  Zakai 's  equation  holds  and  applies  the  method  of  theorem 

5.5  with  careful  attention  to  what  happens  at  Xg,  the  same  conclusion 
tA 

about  e  h  will  follow. 

(iii)  From  the  remark  on  pace  /ff  ,  the  analysis  oc  theorem  5.5  car 
be  applied  to  the  case  when  any  ere  of  the  boundaries  *  s  reeds'*  arc 
either  instantaneously  reflecting,  on  purely  absorbing. 

In  tneorem  5.5,  as  in  toe  assorted  Brown: an  met* on  case  :* 
section  2,  : -  2  -  2d)  is  invariant  rn.ee'*  -  a**:  •* .  it  .•,*!'  -ct 
contain  an  invar* ant  set  o*  analytic  .estsrs  *;r  -  an:  n. 


In  our  analysis  of  Lie  algebraic  tecr.nioues  we. excluded  a 
number  of  possible,  finite  dimensional  examples  by  restricting  our¬ 
selves  to  problems  satisfying  hypothesis  (H).  However,  remarKS  (i) 
(iv)  state  that  the  results  of  theorem  5.5  are  true,  at  least  by 
formal  arguments,  for  any  excluded  case  for  which  the  conditional 
distribution  can  be  found  by  solving  Zakai's  equation  for  with 
(possible)  boundary  conditions.  Thus,  we  conjecture  that  no  ctner 
examples  beyond  those  of  Senes  can,  in  fact,  be  solved  by  the  method 
developed  in  this  chapter. 
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APPENDIX  1  HOMOGENEOUS  CHAOS  THEORY 


Wiener's  homogeneous  chaos  theory  provides  a  method  of  reore- 
senting  functions  of  Gaussian  processes  by  certain  infinite  expansions 
the  terms  of  which  are  the  prototypes  of  multiple  stochastic  integrals 
This  appendix  presents  the  fundamental  motions  or  the  theory;  the 
treatment  follows  that  of  Kal 1 ianpun  [21  ]. 

The  situation  is  as  follows.  Let  T  be  a  separable, 
topological  space.  {x(t)|t  £  T}  will  denote  a  mean  zero  Gaussian  pro¬ 
cess  on  T  with  a  covariance  function  c(t,s)  =  Ex(t)x(s)  that  is 
jointly  continuous  in  t  and  s.  Let  the  probability  space  of  the 
process  be  (f.,  F,  ?)  and  let  A  =  --{x(s)js  £  T}.  Homogeneous  chaos 
theory  concerns  itself  with  the  structure  of  L  (f.,A,P)  considered  as 
a  Hilbert  space  with  inner  product  <-,y>  =  E[:v].  -t  seeks  to  build 
a  useful  orthonormal  basis  for  L~(c,A,P). 


We  present  the  basic  construction.  Let 
N 

H  =  Spar,  {  )  a,x(t.)!t.  e  T,  N  <  »> 

T  1  1 

(' - '  denotes  closure  in  the  norm  of  L“(.*.,A,?)  1  . 

,  ...  .... 

of  L“(:.,A,P)  consisting  or  nnear  runctionais  or  < 

separable,  because  of  she  separability  c*  ~  arc  t.n 

c(t,s),  and  hence  H  has  a  counted  e  ortl-pr.cmi r,al 


t.ne  suosoac; 


ti nuni ty  c* 


— 


3,  =  { 1  > 

S  is  called  the  n'n  homogeneous  chaos  of  {x(t)'t  t  T; . 


1 


Evidently,  ?  =  3  G  ,  in  which  ©  denotes  a  direct 


n=0 


sum  of  Hilbert  spaces.  The 


G  thus  aive  a  Gram-Schrnict  tvoe 
n 


decomccsi tion  of  ?.  To  provide  orthoncrmai  cases  for  the  G^,  we 
introduce  the  Hernice  polynomials,  which,  as  it  will  be  seen  are 
naturally  associated  to  Gaussian  processes. 

Cefi ni tlcn  A .  1.2  The  n'n  Hermite  oolvncmial  is  defined  as 


hn(x) 


^  1 1  n  2  ,  ^  j  n  2  .  ^ 

'-I  x/2  d  -x/u 

■ _ '  n  1  _  ' 


v  n ! 


dx 


’heorem  A. .1.1  (Homogeneous  chaos  expansion) 


-or  ever/  n 
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APPENDIX  2.  SOME  CONCEPTS  FROM  OPERATOR  THEORY 

The  definitions  and  theorems  follow  Reed  and  Simon  [41,42 

Let  A  and  3  be  linear  operators  on  dense  domains  D 
and  O’  respectively  in  a  (complex)  Hilbert  space  H.  Let  (•,*)  denote 
the  inner  product  in  H. 

Definition  A. 2.1 


(1) 

A  is  closed  if  its  graph  {<$,Ao>j$ 

i  D;  is  a  closes 

subset  of  H  =< 

H  w.r.t.  the  norm  ' <o,p> | i  =  ijslj 

(2) 

3  extends  A  if  DC  D1  and  Be 

=  Ac ,  0  £  D. 

(3) 

Ajo  is  the  smallest  closed  extension  of  A,  assuming 

a  closed  extension  exists. 

(4) 

Spectrum  of  A  =  a(A)  =  t  -  {•'<;  a I -  A 

bijects  D  onto 

Definition  A. 2 

.2 

(1) 

D(A*)  =  [■»  t  H!  :  H  s.t.  (Aw, 3) 

=  f -  n 

>  9  .  *  -  -  w  •  * 

For  3  s  D(A*) 

,  A*3  = 

(2) 

A  is  symmetric  if  (Ay,:)  =  (y,Ac) 

for  ail  : , :  D. 

(3) 

A  is  self-aajoint  if  A  =  A*,  that 

i  sir"  A  is 

symmetric  and 

D( A*)  =  D. 

(4) 

A  is  essentially  sel ‘-adjoint  i~  - 

■  D  is  se! * -adjc • ' 

r<z'v.  -.2.1 

^  c-  1 '  *  ■ .  j  r  c  "2 

or  A  35' *- ao ' o 1 1 ,  -  -  T  .  r'ere  *s  a  ~ae  ■  “’■om  tre 
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boundec  Borel  functions  on  -(A)  to  bounceo  1’near  ooerators  on  - 
such  that 

«(0  #(g)  =  ?(fg) 

«0)  =  i 
[j(f)]*  =  *[f] 

and  if  n  (x)  -<■  x  pointwise,  !h  (x)i  <  :x!  for  all  x  and  n. 
n  n  — 

;  1  im  #(h„)y  =  Aa  V  b  £  D. 

n  ’ 

n-** 

1 

i  Exampl e  If  -A  is  self-adjoint  and  bounded  below  (i.e., 

s(-A)  (x,“),  X  >  ~»  ,  '^  =  e"^  =  ( e u x )  is  well  defined  by  the 

spectral  theorem  for  t  _>  0.  One  can  show  it  is  a  bounded  semigroup 
on  H. 

Definition  A. 2.3 

•i/  e  D  is  an  analytic  vector  for  A  if  An  u  D  for  all 
-  tn  n 

n  and  >  r  1  : A  i> :  i  has  a  oositive  radius  of  convergence. 
n=0  n- 

Theorem  A.2.2  (Nelson  [38]). 

Suppose  that  .A  is  symmetric  and  D  is  invariant  under  A 
and  contains  a  dense  set  of  analytic  vectors.  Then  A  is  essentially 
self-adjoint  on  D. 
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Let  G  be  a  finite  dimensional  Lie  algebra,  and  G  its 
associated  simply  connected  group. 

Theorem  A. 2. 3  Let  T(G)  represent  G  by  skew-symmetric  operators 
on  a  domain  D  of  H.  If  there  exists  X  e  T(G)  such  that  iX  (on 
0)  is  not  essentially  self-adjoint,  then  G  has  no  unitary 
representation  on  H.  Further  if  is  a  basis  of 

T(G),  x-]  +  "'+xCj  will  not  136  essentially  self-adjoint  on  D  either. 

For  the  notion  of  representation  in  this  theorem  see  definition  5.1 
in  chapter  5. 


